Instance-Level Multitask Learning for 3D Building Extraction from Monocular Off-Nadir Satellite Sensor Imagery

Shi, Wenxu and Meng, Qingyan and Zhang, Linlin and Zhao, Maofan and Su, Chen and Guo, Guinan and Li, Ting and Atkinson, Peter M. (2025) Instance-Level Multitask Learning for 3D Building Extraction from Monocular Off-Nadir Satellite Sensor Imagery. IEEE Transactions on Geoscience and Remote Sensing, 63. ISSN 0196-2892

Full text not available from this repository.

Abstract

Extracting 3D building information from monocular satellite sensor imagery remains a formidable challenge in the field of remote sensing. Multitask frameworks based on deep learning, which particularly for simultaneously predicting 2D building outlines and their respective heights using ortho-rectified satellite imagery, have shown promise in addressing this challenge. Height estimation is notably complex due to the absence of explicit height indicators, limited interaction between semantic-height features, and inadequate representation of building relationships. Moreover, the availability of data sources is a limiting factor for broader application. To overcome these issues, this study introduces an innovative instance-level multitask learning model (named BDH-Net) that leverages off-nadir perspectives and roof-to-footprint offset vectors to enhance modeling. This model comprises four key components: a pixel-wise feature extraction image encoder-decoder, a query transformer decoder, a multitask decoder, and a height decoder that employs intra-instance and inter-instance attention for precise building height estimation. Additionally, we pioneer the use of Google Earth imagery to construct an off-nadir satellite dataset with roof-to-footprint offset vectors specifically designed for building instance segmentation and height prediction, known as the BDH dataset. Comprehensive experiments demonstrate that the proposed BDH-Net significantly improves the accuracy of monocular 3D building data extraction by integrating roof-to-footprint offset vectors and leveraging context specific to each building instance. With the extensive coverage and regular updates of Google Earth imagery, BDH-Net holds substantial potential for wide-ranging and long-term applications. The source code of the proposed BDH-Net and the BDH dataset are publicly available at https://github.com/wishx98/BDHNet.

Item Type:
Journal Article
Journal or Publication Title:
IEEE Transactions on Geoscience and Remote Sensing
Uncontrolled Keywords:
/dk/atira/pure/subjectarea/asjc/2200/2208
Subjects:
?? electrical and electronic engineeringearth and planetary sciences(all) ??
ID Code:
235864
Deposited By:
Deposited On:
06 Mar 2026 15:20
Refereed?:
Yes
Published?:
Published
Last Modified:
06 Mar 2026 23:10