Models and datasets for annotated code search.
☆35May 22, 2023Updated 2 years ago
Alternatives and similar repositories for codesearch
Users that are interested in codesearch are comparing it to the libraries listed below
Sorting:
- a contextual search engine for software packages built on import2vec embeddings (https://www.code-compass.com)☆38Jan 14, 2026Updated 2 months ago
- ☆19Dec 8, 2022Updated 3 years ago
- ☆23Mar 25, 2023Updated 2 years ago
- A dataset for natural language code search.☆14Feb 13, 2020Updated 6 years ago
- ☆12Nov 14, 2021Updated 4 years ago
- A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.☆55Feb 24, 2022Updated 4 years ago
- StaQC: a systematically mined dataset containing around 148K Python and 120K SQL domain question-code pairs, as described in "StaQC: A Sy…☆172Aug 28, 2021Updated 4 years ago
- ☆11Jul 25, 2020Updated 5 years ago
- evaluation dataset consisting of natural language query and code snippet pairs☆124May 3, 2024Updated last year
- Improving Code Readability Classification using Convolutional Neural Networks☆10Apr 18, 2018Updated 7 years ago
- Web queries dataset for code search☆32Jun 3, 2023Updated 2 years ago
- Code for "Deep Graph Matching and Searching for Semantic Code Retrieval"☆24Oct 15, 2021Updated 4 years ago
- Neural bag of words code search implementation using PyTorch and data from the CodeSearchNet project.☆72Jan 6, 2023Updated 3 years ago
- Transformer-based approaches for an efficient docstrings generation on a piece of Python's code.☆17Feb 16, 2026Updated last month
- NLP2API: Query Reformulation for Code Search using Crowdsourced Knowledge and Extra-Large Data Analytics.☆12Dec 31, 2020Updated 5 years ago
- Code search model based the self-attention☆12Oct 16, 2020Updated 5 years ago
- Code generation from natural language with less prior and more monolingual data☆13Aug 24, 2021Updated 4 years ago
- An audit tool for software projects based on the ISO/IEC 25010 specification☆10Jul 27, 2020Updated 5 years ago
- A benchmark for evaluating embeddings of identifiers in source code.☆22Aug 23, 2021Updated 4 years ago
- Seamless Synchronization of Distributed Web Clients☆15Nov 24, 2021Updated 4 years ago
- Mapping Language to Code in a Programmatic Context☆80Jan 27, 2021Updated 5 years ago
- Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“☆65Apr 18, 2022Updated 3 years ago
- source code for "Multi-Modal Attention Network Learning for Semantic Source Code Retrieval"☆20Jun 2, 2021Updated 4 years ago
- Contrastive Code Representation Learning: functionality-based JavaScript embeddings through self-supervised learning☆169Dec 26, 2021Updated 4 years ago
- A profiler for clojure STM☆23Mar 13, 2012Updated 14 years ago
- Code and data for ACL20 paper "Incorporating External Knowledge through Pre-training for Natural Language to Code Generation"☆97Sep 22, 2025Updated 5 months ago
- TDCleaner: A Tool for Detecting Obsolete TODO Comments in Software Repos☆12Dec 9, 2021Updated 4 years ago
- DeepCS: Deep Code Search☆283May 26, 2022Updated 3 years ago
- The replication package of <Sentiment Analysis for Software Engineering: How Far Can Pre-trained Transformer Models Go?>. Accepted by IC…☆11Nov 29, 2023Updated 2 years ago
- Dataset and code for Findings of EMNLP'21 paper "CodeQA: A Question Answering Dataset for Source Code Comprehension".☆43Dec 23, 2023Updated 2 years ago
- This is the artifact for paper “Are Machine Learning Cloud APIs Used Correctly? (#421)” in ICSE2021☆16Feb 27, 2021Updated 5 years ago
- [EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'☆27Oct 9, 2023Updated 2 years ago
- FaCoY Code-to-Code Search Engine☆34Jan 18, 2019Updated 7 years ago
- ☆21Oct 6, 2021Updated 4 years ago
- Code Generator☆23Feb 16, 2023Updated 3 years ago
- Code for generating the JuICe dataset.☆37Oct 27, 2021Updated 4 years ago
- Building Training Datasets for Deep Learning Models in Software Engineering and Empirical Software Engineering Research☆26Jun 26, 2024Updated last year
- This repo is the implementation of the paper "GraphSearchNet: Enhancing GNNs via Capturing Global Dependency for Semantic Code Search". W…☆32Dec 31, 2022Updated 3 years ago
- This repo is the benchmark for source code summarization on C language☆26Mar 18, 2021Updated 5 years ago