计算机科学
一致性(知识库)
编码(集合论)
代码评审
预处理器
语义搜索
软件
情报检索
静态程序分析
数据挖掘
语义网
程序设计语言
软件开发
人工智能
集合(抽象数据类型)
作者
Xu Zhang,Z.-P. Lin,Xiaoyu Hu,J.Q. Wang,Wenpeng Lü,Deyu Zhou
摘要
Efficient code search techniques are crucial in accelerating software development by aiding developers in locating specific code snippets and understanding code functionalities. This study investigates code search methodologies, focusing on the emerging significance of semantic consistency in data augmentation techniques. While existing approaches predominantly enhance raw data, often requiring additional preprocessing and incurring higher training costs, this research introduces a pioneering method operating at the code and query representation levels. By bypassing the need for extensive data processing, this novel approach fosters an interactive alignment between code and query, augmenting the semantic coherence crucial for effective code search. An extensive empirical evaluation of a diverse dataset across multiple programming languages substantiates the efficacy of this approach in significantly enhancing code search model performance compared to traditional methodologies. The implementation is publicly available on GitHub 1 , offering an accessible resource for further exploration and application.
科研通智能强力驱动
Strongly Powered by AbleSci AI