Few-shot 3D Point Cloud Segmentation via Relation Consistency-guided Heterogeneous Prototypes

Wei, L. and Lang, C. and Xu, Z. and Liang, L. and Liu, J. (2025) Few-shot 3D Point Cloud Segmentation via Relation Consistency-guided Heterogeneous Prototypes. IEEE Transactions on Multimedia. ISSN 1520-9210

[thumbnail of TMM_final_file]
Text (TMM_final_file)
TMM_final_file.pdf - Accepted Version
Available under License Creative Commons Attribution.

Download (4MB)

Abstract

Few-shot 3D point cloud semantic segmentation is a challenging task due to the lack of labeled point clouds (support set). To segment unlabeled query point clouds, existing prototype-based methods learn 3D prototypes from point features of the support set and then measure their distances to the query points. However, such homogeneous 3D prototypes are often of low quality because they overlook the valuable heterogeneous information buried in the support set, such as semantic labels and projected 2D depth maps. To address this issue, in this paper, we propose a novel Relation Consistency-guided Heterogeneous Prototype learning framework (RCHP), which improves prototype quality by integrating heterogeneous information using large multi-modal models (e.g. CLIP). RCHP achieves this through two core components: Heterogeneous Prototype Generation module which collaborates with 3D networks and CLIP to generate heterogeneous prototypes, and Heterogeneous Prototype Fusion module which effectively fuses heterogeneous prototypes to obtain high-quality prototypes. Furthermore, to bridge the gap between heterogeneous prototypes, we introduce a Heterogeneous Relation Consistency loss, which transfers more reliable inter-class relations (i.e., inter-prototype relations) from refined prototypes to heterogeneous ones. Extensive experiments conducted on five point cloud segmentation datasets, including four indoor datasets (S3DIS, ScanNet, SceneNN, NYU Depth V2) and one outdoor dataset (Semantic3D), demonstrate the superiority and generalization capability of our method, outperforming state-of-the-art approaches across all datasets. The code will be released as soon as the paper is accepted.

Item Type:
Journal Article
Journal or Publication Title:
IEEE Transactions on Multimedia
Uncontrolled Keywords:
/dk/atira/pure/subjectarea/asjc/2200/2214
Subjects:
?? media technologysignal processingcomputer science applicationselectrical and electronic engineering ??
ID Code:
229247
Deposited By:
Deposited On:
07 May 2025 09:15
Refereed?:
Yes
Published?:
Published
Last Modified:
16 May 2025 01:50