Multi-class strategies for joint building footprint and road detection in remote sensing

dc.contributor.authorAyala Lauroba, Christian
dc.contributor.authorAranda, Carlos
dc.contributor.authorGalar Idoate, Mikel
dc.contributor.departmentInstitute of Smart Cities - ISCen
dc.contributor.funderGobierno de Navarra / Nafarroako Gobernua, 0011-1408-2020-000008es
dc.date.accessioned2022-01-12T08:17:14Z
dc.date.available2022-01-12T08:17:14Z
dc.date.issued2021
dc.description.abstractBuilding footprints and road networks are important inputs for a great deal of services. For instance, building maps are useful for urban planning, whereas road maps are essential for disaster response services. Traditionally, building and road maps are manually generated by remote sensing experts or land surveying, occasionally assisted by semi-automatic tools. In the last decade, deep learning-based approaches have demonstrated their capabilities to extract these elements automatically and accurately from remote sensing imagery. The building footprint and road network detection problem can be considered a multi-class semantic segmentation task, that is, a single model performs a pixel-wise classification on multiple classes, optimizing the overall performance. However, depending on the spatial resolution of the imagery used, both classes may coexist within the same pixel, drastically reducing their separability. In this regard, binary decomposition techniques, which have been widely studied in the machine learning literature, are proved useful for addressing multiclass problems. Accordingly, the multi-class problem can be split into multiple binary semantic segmentation sub-problems, specializing different models for each class. Nevertheless, in these cases, an aggregation step is required to obtain the final output labels. Additionally, other novel approaches, such as multi-task learning, may come in handy to further increase the performance of the binary semantic segmentation models. Since there is no certainty as to which strategy should be carried out to accurately tackle a multi-class remote sensing semantic segmentation problem, this paper performs an in-depth study to shed light on the issue. For this purpose, open-access Sentinel-1 and Sentinel-2 imagery (at 10 m) are considered for extracting buildings and roads, making use of the well-known U-Net convolutional neural network. It is worth stressing that building and road classes may coexist within the same pixel when working at such a low spatial resolution, setting a challenging problem scheme. Accordingly, a robust experimental study is developed to assess the benefits of the decomposition strategies and their combination with a multi-task learning scheme. The obtained results demonstrate that decomposing the considered multi-class remote sensing semantic segmentation problem into multiple binary ones using a One-vs-All binary decomposition technique leads to better results than the standard direct multi-class approach. Additionally, the benefits of using a multi-task learning scheme for pushing the performance of binary segmentation models are also shown.en
dc.description.sponsorshipChristian Ayala was partially supported by the Goverment of Navarra under the industrial PhD program 2020 reference 0011-1408-2020-000008. Mikel Galar was partially supported by Tracasa Instrumental S.L. under projects OTRI 2018-901-073, OTRI 2019-901-091 and OTRI 2020-901-050, and by the Spanish MICIN (PID2019-108392GB-I00 / AEI / 10.13039/501100011033).en
dc.format.extent18 p.
dc.format.mimetypeapplication/pdfen
dc.identifier.doi10.3390/app11188340
dc.identifier.issn2076-3417
dc.identifier.urihttps://academica-e.unavarra.es/handle/2454/41735
dc.language.isoengen
dc.publisherMDPIen
dc.relation.ispartofApplied Sciences, 11 (18), 8340en
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2019-108392GB-I00/ES/
dc.relation.publisherversionhttps://doi.org/10.3390/app11188340
dc.rights© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.en
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectSentinel-1en
dc.subjectSentinel-2en
dc.subjectRemote sensingen
dc.subjectBuilding detectionen
dc.subjectRoad detectionen
dc.subjectDeep learningen
dc.subjectConvolutional neural networksen
dc.subjectMulti-class semantic segmentationen
dc.subjectBinary semantic segmentationen
dc.subjectMulti-task semantic segmentationen
dc.titleMulti-class strategies for joint building footprint and road detection in remote sensingen
dc.typeinfo:eu-repo/semantics/article
dc.type.versioninfo:eu-repo/semantics/publishedVersion
dspace.entity.typePublication
relation.isAuthorOfPublication4c0a0a12-02e3-479d-8562-b5d9a39bab40
relation.isAuthorOfPublication44c7a308-9c21-49ef-aa03-b45c2c5a06fd
relation.isAuthorOfPublication.latestForDiscovery4c0a0a12-02e3-479d-8562-b5d9a39bab40

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Ayala_MultiClass.pdf
Size:
3.84 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description: