Unpaired 3D Shape-to-Shape Translation via Gradient-Guided Triplane Diffusion

Zhang, Wenxiao and Rahmani, Hossein and Liu, Jun (2025) Unpaired 3D Shape-to-Shape Translation via Gradient-Guided Triplane Diffusion. IEEE Transactions on Visualization and Computer Graphics. ISSN 1077-2626

[thumbnail of TVCG-2024-01-0036.R1_Proof_hi]
Text (TVCG-2024-01-0036.R1_Proof_hi)
TVCG-2024-01-0036.R1_Proof_hi.pdf - Accepted Version
Available under License Creative Commons Attribution.

Download (4MB)

Abstract

Unpaired shape-to-shape translation refers to the task of transforming the geometry and semantics of an input shape into a new shape domain without paired training data. Previous methods utilize GAN-based architectures to perform shape translation, employing adversarial training to transform the source shape encoding into the target domain in the low-dimensional latent feature space. However, these methods encounter difficulties in generating diverse and high-quality results, as they often suffer from issues such as “mode collapse”. This leads to limited generation diversity and makes it challenging to find an accurate latent code that adequately represents the input shape. In this paper, we achieve unpaired shape-to-shape translation via a triplane diffusion model, in which we factorize 3D objects into triplane representations and conduct a diffusion process on these representations to accomplish shape domain transformation. We observe that by adding an appropriate amount of noise to an input object during the forward diffusion process, domain-specific shape structures are smoothed out while the overall structure is still preserved. Subsequently, we progressively remove the noise via an unconditional diffusion model trained on the target shape domain in the reverse diffusion process. This allows us to obtain a denoised output that retains the structural similarities of the source input while aligning with the distribution of the target shape domain. During this process, we propose two gradient-based guidance mechanisms to guide the translation process to guarantee more faithful results during the denoising process. We conduct extensive experiments on different shape domains, and the experimental results demonstrate that our method achieves superior shape fidelity with high quality compared to current state-of-the-art baselines.

Item Type:
Journal Article
Journal or Publication Title:
IEEE Transactions on Visualization and Computer Graphics
Uncontrolled Keywords:
/dk/atira/pure/subjectarea/asjc/1700/1711
Subjects:
?? signal processingcomputer graphics and computer-aided designsoftwarecomputer vision and pattern recognition ??
ID Code:
227793
Deposited By:
Deposited On:
25 Feb 2025 15:50
Refereed?:
Yes
Published?:
Published
Last Modified:
13 Mar 2025 15:38