Please wait a minute...
Frontiers of Computer Science

ISSN 2095-2228

ISSN 2095-2236(Online)

CN 10-1014/TP

Postal Subscription Code 80-970

2018 Impact Factor: 1.129

Front. Comput. Sci.    2024, Vol. 18 Issue (3) : 183209    https://doi.org/10.1007/s11704-023-2678-8
LETTER
Semantic similarity-based program retrieval: a multi-relational graph perspective
Qianwen GOU1, Yunwei DONG1(), YuJiao WU1, Qiao KE2
1. School of Computer Science, Northwestern Polytechnical University, Xi’an 710129, China
2. School of Mathematics and Statistics, Northwestern Polytechnical University, Xi’an 710129, China
 Download: PDF(1046 KB)   HTML
 Export: BibTeX | EndNote | Reference Manager | ProCite | RefWorks
Corresponding Author(s): Yunwei DONG   
About author:

Peng Lei and Charity Ngina Mwangi contributed equally to this work.

Just Accepted Date: 13 September 2023   Issue Date: 17 November 2023
 Cite this article:   
Qianwen GOU,Yunwei DONG,YuJiao WU, et al. Semantic similarity-based program retrieval: a multi-relational graph perspective[J]. Front. Comput. Sci., 2024, 18(3): 183209.
 URL:  
https://academic.hep.com.cn/fcs/EN/10.1007/s11704-023-2678-8
https://academic.hep.com.cn/fcs/EN/Y2024/V18/I3/183209
Fig.1  An overview of USRAE, including: 1) multi-relational graph construction; 2) multi-relational graph embedding; 3) semantic similarity calculation
Models SR@1 SR@5 SR@10 MRR
UNIF 0.435 0.652 0.756 0.493
DeepCS 0.528 0.692 0.771 0.541
CARLCS 0. 604 0.752 0.803 0.600
TABCS 0.651 0.821 0.835 0.669
DGMS 0.721 0.953 0.964 0.811
USRAE 0.813 0.963 0.982 0.877
Tab.1  Effectiveness of USRAE in terms of SR@k and MRR on JB-Java dataset (Best scores are in boldface)
Models SR@1 SR@5 SR@10 MRR
UNIF 0.416 0.645 0.747 0.472
DeepCS 0.471 0.653 0.753 0.502
CARLCS 0.585 0.693 0.752 0.573
TABCS 0.662 0.721 0.792 0.628
DGMS 0.778 0.819 0.897 0.662
USRAE 0.801 0.959 0.980 0.862
Tab.2  Effectiveness of USRAE in terms of SR@k and MRR on CSN-Python dataset (Best scores are in boldface)
1 Sadowski C, Stolee K T, Elbaum S. How developers search for code: a case study. In: Proceedings of the 10th Joint Meeting on Foundations of Software Engineering. 2015, 191–201
2 X, Ling L, Wu S, Wang G, Pan T, Ma F, Xu A X, Liu C, Wu S Ji . Deep graph matching and searching for semantic code retrieval. ACM Transactions on Knowledge Discovery from Data, 2021, 15( 5): 88
3 F, Yamaguchi N, Golde D, Arp K Rieck . Modeling and discovering vulnerabilities with code property graphs. In: Proceedings of 2014 IEEE Symposium on Security and Privacy. 2014, 590–604
4 L, Banarescu C, Bonial S, Cai M, Georgescu K, Griffitt U, Hermjakob K, Knight P, Koehn M, Palmer N Schneider . Abstract meaning representation for sembanking. In: Proceedings of the 7th Linguistic Annotation Workshop and Interoperability with Discourse. 2013, 178–186
5 Feng Z, Guo D, Tang D, Duan N, Feng X, Gong M, Shou L, Qin B, Liu T, Jiang D, Zhou M. CodeBERT: a pre-trained model for programming and natural languages. In: Proceedings of Findings of the Association for Computational Linguistics: EMNLP 2020. 2020, 1536–1547
6 Cambronero J, Li H, Kim S, Sen K, Chandra S. When deep learning met code search. In: Proceedings of the 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 2019, 964−974
7 Gu X, Zhang H, Kim S. Deep code search. In: Proceedings of the 40th IEEE/ACM International Conference on Software Engineering. 2018, 933−944
8 J, Shuai L, Xu C, Liu M, Yan X, Xia Y Lei . Improving code search with co-attentive representation learning. In: Proceedings of the 28th International Conference on Program Comprehension. 2020, 196−207
9 L, Xu H, Yang C, Liu J, Shuai M, Yan Y, Lei Z Xu . Two-stage attention-based model for code search with textual and structural features. In: Proceedings of 2021 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER). 2021, 342−353
[1] FCS-22678-OF-QC_suppl_1 Download
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed