Building extraction from VHR remote sensing imagery by combining an improved deep convolutional encoder-decoder architecture and historical land use vector map

Feng, Wenqing; Sui, Haigang; Hua, Li; Xu, Chuan; Ma, Guorui; Huang, Weiming

Building extraction from VHR remote sensing imagery by combining an improved deep convolutional encoder-decoder architecture and historical land use vector map

Mark

Feng, Wenqing ; Sui, Haigang ; Hua, Li ; Xu, Chuan ; Ma, Guorui and Huang, Weiming ^LU (2020) In International Journal of Remote Sensing 41(17). p.6595-6617

Abstract: Building extraction has attracted considerable attention in the field of remote sensing image analysis. Fully convolutional network modelling is a recently developed technique that is capable of significantly enhancing building extraction accuracy. It is a prominent branch of deep learning and uses advanced state-of-the-art techniques, especially with regard to building segmentation. In this paper, we present an enhanced deep convolutional encoder-decoder (DCED) network by incorporating historical land use vector maps (HVMs) customized for building extraction. The approach combines enhanced DCED architecture with multi-scale image pyramid for pixel-wise building segmentation. The improved DCED network, together with symmetrical... (More); Building extraction has attracted considerable attention in the field of remote sensing image analysis. Fully convolutional network modelling is a recently developed technique that is capable of significantly enhancing building extraction accuracy. It is a prominent branch of deep learning and uses advanced state-of-the-art techniques, especially with regard to building segmentation. In this paper, we present an enhanced deep convolutional encoder-decoder (DCED) network by incorporating historical land use vector maps (HVMs) customized for building extraction. The approach combines enhanced DCED architecture with multi-scale image pyramid for pixel-wise building segmentation. The improved DCED network, together with symmetrical dense-shortcut connection structures, is employed to establish the encoders for automatic extraction of building features. The feature maps from early layers were fused with more discriminative feature maps from the deeper layers through ‘Res path’ skip connections for superior building extraction accuracy. To further reduce the occurrence of falsely segmented buildings, and to sharpen the buildings’ boundaries, the new temporal testing image is segmented under the constraints of an HVM. A majority voting strategy is employed to ensure the homogeneity of the building objects as the post-processing method. Experimental results indicate that the proposed approach exhibits competitive quantitative and qualitative performance, effectively alleviating the salt-and-pepper phenomenon and block effects, and retaining the edge structures of buildings. Compared with other state-of-the-art methods, our method demonstrably achieves the optimal final accuracies.
(Less)

Please use this url to cite or link to this publication: https://lup.lub.lu.se/record/4b3c66d6-5611-46f3-9409-ea6f4c360128

author

Feng, Wenqing ; Sui, Haigang ; Hua, Li ; Xu, Chuan ; Ma, Guorui and Huang, Weiming ^LU

organization

Dept of Physical Geography and Ecosystem Science

publishing date

2020

type

Contribution to journal

publication status

published

subject

Multidisciplinary Geosciences

in

International Journal of Remote Sensing

volume

41

issue

17

pages

23 pages

publisher

Taylor & Francis

external identifiers

scopus:85087456817

ISSN

0143-1161

DOI

10.1080/01431161.2020.1742944

language

English

LU publication?

yes

id

4b3c66d6-5611-46f3-9409-ea6f4c360128

date added to LUP

2020-07-15 09:49:32

date last changed

2025-04-04 15:06:10

@article{4b3c66d6-5611-46f3-9409-ea6f4c360128,
  abstract     = {{<p>Building extraction has attracted considerable attention in the field of remote sensing image analysis. Fully convolutional network modelling is a recently developed technique that is capable of significantly enhancing building extraction accuracy. It is a prominent branch of deep learning and uses advanced state-of-the-art techniques, especially with regard to building segmentation. In this paper, we present an enhanced deep convolutional encoder-decoder (DCED) network by incorporating historical land use vector maps (HVMs) customized for building extraction. The approach combines enhanced DCED architecture with multi-scale image pyramid for pixel-wise building segmentation. The improved DCED network, together with symmetrical dense-shortcut connection structures, is employed to establish the encoders for automatic extraction of building features. The feature maps from early layers were fused with more discriminative feature maps from the deeper layers through ‘Res path’ skip connections for superior building extraction accuracy. To further reduce the occurrence of falsely segmented buildings, and to sharpen the buildings’ boundaries, the new temporal testing image is segmented under the constraints of an HVM. A majority voting strategy is employed to ensure the homogeneity of the building objects as the post-processing method. Experimental results indicate that the proposed approach exhibits competitive quantitative and qualitative performance, effectively alleviating the salt-and-pepper phenomenon and block effects, and retaining the edge structures of buildings. Compared with other state-of-the-art methods, our method demonstrably achieves the optimal final accuracies.</p>}},
  author       = {{Feng, Wenqing and Sui, Haigang and Hua, Li and Xu, Chuan and Ma, Guorui and Huang, Weiming}},
  issn         = {{0143-1161}},
  language     = {{eng}},
  number       = {{17}},
  pages        = {{6595--6617}},
  publisher    = {{Taylor & Francis}},
  series       = {{International Journal of Remote Sensing}},
  title        = {{Building extraction from VHR remote sensing imagery by combining an improved deep convolutional encoder-decoder architecture and historical land use vector map}},
  url          = {{http://dx.doi.org/10.1080/01431161.2020.1742944}},
  doi          = {{10.1080/01431161.2020.1742944}},
  volume       = {{41}},
  year         = {{2020}},
}

Lund University Publications

LUND UNIVERSITY LIBRARIES

Building extraction from VHR remote sensing imagery by combining an improved deep convolutional encoder-decoder architecture and historical land use vector map