Object-Based Semi-Supervised Spatial Attention Residual UNet for Urban High-Resolution Remote Sensing Image Classification

Lu, Yuanbing and Li, Huapeng and Zhang, Ce and Zhang, Shuqing (2024) Object-Based Semi-Supervised Spatial Attention Residual UNet for Urban High-Resolution Remote Sensing Image Classification. Remote Sensing, 16 (8): 1444. ISSN 2072-4292

Full text not available from this repository.

Abstract

Accurate urban land cover information is crucial for effective urban planning and management. While convolutional neural networks (CNNs) demonstrate superior feature learning and prediction capabilities using image-level annotations, the inherent mixed-category nature of input image patches leads to classification errors along object boundaries. Fully convolutional neural networks (FCNs) excel at pixel-wise fine segmentation, making them less susceptible to heterogeneous content, but they require fully annotated dense image patches, which may not be readily available in real-world scenarios. This paper proposes an object-based semi-supervised spatial attention residual UNet (OS-ARU) model. First, multiscale segmentation is performed to obtain segments from a remote sensing image, and segments containing sample points are assigned the categories of the corresponding points, which are used to train the model. Then, the trained model predicts class probabilities for all segments. Each unlabeled segment’s probability distribution is compared against those of labeled segments for similarity matching under a threshold constraint. Through label propagation, pseudo-labels are assigned to unlabeled segments exhibiting high similarity to labeled ones. Finally, the model is retrained using the augmented training set incorporating the pseudo-labeled segments. Comprehensive experiments on aerial image benchmarks for Vaihingen and Potsdam demonstrate that the proposed OS-ARU achieves higher classification accuracy than state-of-the-art models, including OCNN, 2OCNN, and standard OS-U, reaching an overall accuracy (OA) of 87.83% and 86.71%, respectively. The performance improvements over the baseline methods are statistically significant according to the Wilcoxon Signed-Rank Test. Despite using significantly fewer sparse annotations, this semi-supervised approach still achieves comparable accuracy to the same model under full supervision. The proposed method thus makes a step forward in substantially alleviating the heavy sampling burden of FCNs (densely sampled deep learning models) to effectively handle the complex issue of land cover information identification and classification.

Item Type:

Journal Article

Journal or Publication Title:

Remote Sensing

Uncontrolled Keywords:

/dk/atira/pure/subjectarea/asjc/1900/1900

Subjects:

?? cnnobiaunetsemi-supervisedsemantic segmentationclassificationgeneral earth and planetary sciencesearth and planetary sciences(all) ??

Departments:

Faculty of Science and Technology > Lancaster Environment Centre

ID Code:

218987

Deposited By:

ep_importer_pure

Deposited On:

29 Apr 2024 10:15

Refereed?:

Yes

Published?:

Published

Last Modified:

08 May 2025 04:51

URI:

https://eprints.lancs.ac.uk/id/eprint/218987