Hyperbolic Code Retrieval: A Novel Approach for Efficient Code Search Using Hyperbolic Space Embeddings

Tang, Xunzhu and Chen, zhenghan and Ezzini, Saad and Tian, Haoye and Song, Yewei and Klein, Jacques and Bissyande, Tegawende F. (2023) Hyperbolic Code Retrieval: A Novel Approach for Efficient Code Search Using Hyperbolic Space Embeddings. Other. UNSPECIFIED.

[thumbnail of pdf]
Other (pdf)
Download (0B)
[thumbnail of pdf]
Other (pdf)
2308.15234

Download (1MB)

Abstract

Within the realm of advanced code retrieval, existing methods have primarily relied on intricate matching and attention-based mechanisms. However, these methods often lead to computational and memory inefficiencies, posing a significant challenge to their real-world applicability. To tackle this challenge, we propose a novel approach, the Hyperbolic Code QA Matching (HyCoQA). This approach leverages the unique properties of Hyperbolic space to express connections between code fragments and their corresponding queries, thereby obviating the necessity for intricate interaction layers. The process commences with a reimagining of the code retrieval challenge, framed within a question-answering (QA) matching framework, constructing a dataset with triple matches characterized as \texttt{}. These matches are subsequently processed via a static BERT embedding layer, yielding initial embeddings. Thereafter, a hyperbolic embedder transforms these representations into hyperbolic space, calculating distances between the codes and descriptions. The process concludes by implementing a scoring layer on these distances and leveraging hinge loss for model training. Especially, the design of HyCoQA inherently facilitates self-organization, allowing for the automatic detection of embedded hierarchical patterns during the learning phase. Experimentally, HyCoQA showcases remarkable effectiveness in our evaluations: an average performance improvement of 3.5\% to 4\% compared to state-of-the-art code retrieval techniques.

Item Type:
Monograph (Other)
Subjects:
?? computer science - software engineering ??
ID Code:
212323
Deposited By:
Deposited On:
17 Jan 2024 14:40
Refereed?:
No
Published?:
Published
Last Modified:
03 Apr 2024 00:32