A Self-Training Hierarchical Prototype-based Ensemble Framework for Remote Sensing Scene Classification

Gu, Xiaowei and Zhang, Ce and Shen, Qiang and Han, Jungong and Angelov, Plamen and Atkinson, Peter (2022) A Self-Training Hierarchical Prototype-based Ensemble Framework for Remote Sensing Scene Classification. Information Fusion, 80. pp. 179-204. ISSN 1566-2535

[thumbnail of STHPensemble_final]
Text (STHPensemble_final)
STHPensemble_final.pdf - Accepted Version
Available under License Creative Commons Attribution-NonCommercial-NoDerivs.

Download (3MB)


Remote sensing scene classification plays a critical role in a wide range of real-world applications. Technically, however, scene classification is an extremely challenging task due to the huge complexity in remotely sensed scenes, and the difficulty in acquiring labelled data for model training such as supervised deep learning. To tackle these issues, a novel semi-supervised ensemble framework is proposed here using the self-training hierarchical prototype-based classifier as the base learner for chunk-by-chunk prediction. The framework has the ability to build a powerful ensemble model from both labelled and unlabelled images with minimum supervision. Different feature descriptors are employed in the proposed ensemble framework to offer multiple independent views of images. Thus, the diversity of base learners is guaranteed for ensemble classification. To further increase the overall accuracy, a novel cross-checking strategy was introduced to enable the base learners to exchange pseudo-labelling information during the self-training process, and maximize the correctness of pseudo-labels assigned to unlabelled images. Extensive numerical experiments on popular benchmark remote sensing scenes demonstrated the effectiveness of the proposed ensemble framework, especially where the number of labelled images available is limited. For example, the classification accuracy achieved on the OPTIMAL-31, PatternNet and RSI-CB256 datasets was up to 99.91%, 98. 67% and 99.07% with only 40% of the image sets used as labelled training images, surpassing or at least on par with mainstream benchmark approaches trained with double the number of labelled images.

Item Type:
Journal Article
Journal or Publication Title:
Information Fusion
Additional Information:
This is the author’s version of a work that was accepted for publication in Information Fusion. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Information Fusion, 80, 2021 DOI: 10.1016/j.inffus.2021.11.014
Uncontrolled Keywords:
?? self-trainingpseudo-labellingprototypesremote sensingscene classificationhardware and architecturesignal processingsoftwareinformation systems ??
ID Code:
Deposited By:
Deposited On:
23 Nov 2021 09:45
Last Modified:
19 Apr 2024 02:24