Building extraction from VHR remote sensing imagery by combining an improved deep convolutional encoder-decoder architecture and historical land use vector map
(2020) In International Journal of Remote Sensing 41(17). p.6595-6617- Abstract
Building extraction has attracted considerable attention in the field of remote sensing image analysis. Fully convolutional network modelling is a recently developed technique that is capable of significantly enhancing building extraction accuracy. It is a prominent branch of deep learning and uses advanced state-of-the-art techniques, especially with regard to building segmentation. In this paper, we present an enhanced deep convolutional encoder-decoder (DCED) network by incorporating historical land use vector maps (HVMs) customized for building extraction. The approach combines enhanced DCED architecture with multi-scale image pyramid for pixel-wise building segmentation. The improved DCED network, together with symmetrical... (More)
Building extraction has attracted considerable attention in the field of remote sensing image analysis. Fully convolutional network modelling is a recently developed technique that is capable of significantly enhancing building extraction accuracy. It is a prominent branch of deep learning and uses advanced state-of-the-art techniques, especially with regard to building segmentation. In this paper, we present an enhanced deep convolutional encoder-decoder (DCED) network by incorporating historical land use vector maps (HVMs) customized for building extraction. The approach combines enhanced DCED architecture with multi-scale image pyramid for pixel-wise building segmentation. The improved DCED network, together with symmetrical dense-shortcut connection structures, is employed to establish the encoders for automatic extraction of building features. The feature maps from early layers were fused with more discriminative feature maps from the deeper layers through ‘Res path’ skip connections for superior building extraction accuracy. To further reduce the occurrence of falsely segmented buildings, and to sharpen the buildings’ boundaries, the new temporal testing image is segmented under the constraints of an HVM. A majority voting strategy is employed to ensure the homogeneity of the building objects as the post-processing method. Experimental results indicate that the proposed approach exhibits competitive quantitative and qualitative performance, effectively alleviating the salt-and-pepper phenomenon and block effects, and retaining the edge structures of buildings. Compared with other state-of-the-art methods, our method demonstrably achieves the optimal final accuracies.
(Less)
- author
- Feng, Wenqing ; Sui, Haigang ; Hua, Li ; Xu, Chuan ; Ma, Guorui and Huang, Weiming LU
- organization
- publishing date
- 2020
- type
- Contribution to journal
- publication status
- published
- subject
- in
- International Journal of Remote Sensing
- volume
- 41
- issue
- 17
- pages
- 23 pages
- publisher
- Taylor & Francis
- external identifiers
-
- scopus:85087456817
- ISSN
- 0143-1161
- DOI
- 10.1080/01431161.2020.1742944
- language
- English
- LU publication?
- yes
- id
- 4b3c66d6-5611-46f3-9409-ea6f4c360128
- date added to LUP
- 2020-07-15 09:49:32
- date last changed
- 2022-08-22 15:15:39
@article{4b3c66d6-5611-46f3-9409-ea6f4c360128, abstract = {{<p>Building extraction has attracted considerable attention in the field of remote sensing image analysis. Fully convolutional network modelling is a recently developed technique that is capable of significantly enhancing building extraction accuracy. It is a prominent branch of deep learning and uses advanced state-of-the-art techniques, especially with regard to building segmentation. In this paper, we present an enhanced deep convolutional encoder-decoder (DCED) network by incorporating historical land use vector maps (HVMs) customized for building extraction. The approach combines enhanced DCED architecture with multi-scale image pyramid for pixel-wise building segmentation. The improved DCED network, together with symmetrical dense-shortcut connection structures, is employed to establish the encoders for automatic extraction of building features. The feature maps from early layers were fused with more discriminative feature maps from the deeper layers through ‘Res path’ skip connections for superior building extraction accuracy. To further reduce the occurrence of falsely segmented buildings, and to sharpen the buildings’ boundaries, the new temporal testing image is segmented under the constraints of an HVM. A majority voting strategy is employed to ensure the homogeneity of the building objects as the post-processing method. Experimental results indicate that the proposed approach exhibits competitive quantitative and qualitative performance, effectively alleviating the salt-and-pepper phenomenon and block effects, and retaining the edge structures of buildings. Compared with other state-of-the-art methods, our method demonstrably achieves the optimal final accuracies.</p>}}, author = {{Feng, Wenqing and Sui, Haigang and Hua, Li and Xu, Chuan and Ma, Guorui and Huang, Weiming}}, issn = {{0143-1161}}, language = {{eng}}, number = {{17}}, pages = {{6595--6617}}, publisher = {{Taylor & Francis}}, series = {{International Journal of Remote Sensing}}, title = {{Building extraction from VHR remote sensing imagery by combining an improved deep convolutional encoder-decoder architecture and historical land use vector map}}, url = {{http://dx.doi.org/10.1080/01431161.2020.1742944}}, doi = {{10.1080/01431161.2020.1742944}}, volume = {{41}}, year = {{2020}}, }