KAPE:
            <i>k</i>
            NN-Based Performance Testing for Deep Code Search

Yuejun Guo; Qiang Hu; Xiaofei Xie; Maxime Cordy; Mike Papadakis; Yves Le Traon

doi:10.1145/3624735

What is it about?

KAPE is an algorithm that can automatically estimate the performance of deep code search models on unseen data without manually matching code snippets to test queries. The main idea is to use semantically similar training data to perform the evaluation.

Why is it important?

Code search is a common yet important activity of software developers. An efficient code search model can largely facilitate the development process and improve the programming quality. Given the superb performance of learning the contextual representations, deep learning models, especially pre-trained language models, have been widely explored for the code search task. However, studies mainly focus on proposing new architectures for ever-better performance on designed test sets but ignore the performance on unseen test data where only natural language queries are available. The same problem in other domains, e.g., CV and NLP, is usually solved by test input selection that uses a subset of the unseen set to reduce the labeling effort. However, approaches from other domains are not directly applicable and still require labeling effort.

This page is a summary of: KAPE: k NN-Based Performance Testing for Deep Code Search, ACM Transactions on Software Engineering and Methodology, September 2023, ACM (Association for Computing Machinery),
DOI: 10.1145/3624735.
You can read the full text:

Read

Contributors

The following have contributed to this page

KAPE: Automatic performance estimation of deep code search

What is it about?

Why is it important?

Contributors

You might also like

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

KAPE: Automatic performance estimation of deep code search

What is it about?

Featured Image

Why is it important?

Read the Original

Contributors

Share this page:

You might also like

Big data analytics in single‐cell transcriptomics: Five grand opportunities

Fast Parallel Hypertree Decompositions in Logarithmic Recursion Depth

On business adoption and use of reproducible builds for open and closed source software

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management