Global and Local Texture Randomization for Synthetic-to-Real Semantic Segmentation

Peng, Duo and Lei, Yinjie and Liu, Lingqiao and Zhang, Pingping and Liu, Jun (2021) Global and Local Texture Randomization for Synthetic-to-Real Semantic Segmentation. IEEE Transactions on Image Processing, 30. pp. 6594-6608. ISSN 1057-7149

Full text not available from this repository.

Abstract

Semantic segmentation is a crucial image understanding task, where each pixel of image is categorized into a corresponding label. Since the pixel-wise labeling for ground-truth is tedious and labor intensive, in practical applications, many works exploit the synthetic images to train the model for real-word image semantic segmentation, i.e., Synthetic-to-Real Semantic Segmentation (SRSS). However, Deep Convolutional Neural Networks (CNNs) trained on the source synthetic data may not generalize well to the target real-world data. To address this problem, there has been rapidly growing interest in Domain Adaption technique to mitigate the domain mismatch between the synthetic and real-world images. Besides, Domain Generalization technique is another solution to handle SRSS. In contrast to Domain Adaption, Domain Generalization seeks to address SRSS without accessing any data of the target domain during training. In this work, we propose two simple yet effective texture randomization mechanisms, Global Texture Randomization (GTR) and Local Texture Randomization (LTR), for Domain Generalization based SRSS. GTR is proposed to randomize the texture of source images into diverse unreal texture styles. It aims to alleviate the reliance of the network on texture while promoting the learning of the domain-invariant cues. In addition, we find the texture difference is not always occurred in entire image and may only appear in some local areas. Therefore, we further propose a LTR mechanism to generate diverse local regions for partially stylizing the source images. Finally, we implement a regularization of Consistency between GTR and LTR (CGL) aiming to harmonize the two proposed mechanisms during training. Extensive experiments on five publicly available datasets (i.e., GTA5, SYNTHIA, Cityscapes, BDDS and Mapillary) with various SRSS settings (i.e., GTA5/SYNTHIA to Cityscapes/BDDS/Mapillary) demonstrate that the proposed method is superior to the state-of-the-art methods for domain generalization based SRSS.

Item Type:
Journal Article
Journal or Publication Title:
IEEE Transactions on Image Processing
Uncontrolled Keywords:
/dk/atira/pure/subjectarea/asjc/1700/1704
Subjects:
?? computer graphics and computer-aided designsoftware ??
ID Code:
222882
Deposited By:
Deposited On:
09 Aug 2024 15:15
Refereed?:
Yes
Published?:
Published
Last Modified:
13 Aug 2024 23:52