Models and datasets for annotated code search.
☆35May 22, 2023Updated 2 years ago
Alternatives and similar repositories for codesearch
Users that are interested in codesearch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Oct 22, 2020Updated 5 years ago
- a contextual search engine for software packages built on import2vec embeddings (https://www.code-compass.com)☆38Jan 14, 2026Updated 3 months ago
- Source Code for ACL-21 main conference paper "CoSQA: 20,000+ Web Queries for Code Search and Question Answering".☆48Nov 2, 2022Updated 3 years ago
- ☆19Dec 8, 2022Updated 3 years ago
- ☆23Mar 25, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A dataset for natural language code search.☆14Feb 13, 2020Updated 6 years ago
- ☆12Nov 14, 2021Updated 4 years ago
- A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.☆55Feb 24, 2022Updated 4 years ago
- StaQC: a systematically mined dataset containing around 148K Python and 120K SQL domain question-code pairs, as described in "StaQC: A Sy…☆172Aug 28, 2021Updated 4 years ago
- ☆11Jul 25, 2020Updated 5 years ago
- evaluation dataset consisting of natural language query and code snippet pairs☆124May 3, 2024Updated last year
- Web queries dataset for code search☆32Jun 3, 2023Updated 2 years ago
- Neural bag of words code search implementation using PyTorch and data from the CodeSearchNet project.☆72Jan 6, 2023Updated 3 years ago
- ☆44Jun 24, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code search model based the self-attention☆12Oct 16, 2020Updated 5 years ago
- An audit tool for software projects based on the ISO/IEC 25010 specification☆10Jul 27, 2020Updated 5 years ago
- A benchmark for evaluating embeddings of identifiers in source code.☆22Aug 23, 2021Updated 4 years ago
- Mapping Language to Code in a Programmatic Context☆80Jan 27, 2021Updated 5 years ago
- Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“☆65Apr 18, 2022Updated 4 years ago
- Contrastive Code Representation Learning: functionality-based JavaScript embeddings through self-supervised learning☆169Dec 26, 2021Updated 4 years ago
- Code and data for ACL20 paper "Incorporating External Knowledge through Pre-training for Natural Language to Code Generation"☆97Sep 22, 2025Updated 7 months ago
- TDCleaner: A Tool for Detecting Obsolete TODO Comments in Software Repos☆12Dec 9, 2021Updated 4 years ago
- DeepCS: Deep Code Search☆284May 26, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Dataset and code for Findings of EMNLP'21 paper "CodeQA: A Question Answering Dataset for Source Code Comprehension".☆43Dec 23, 2023Updated 2 years ago
- This is the artifact for paper “Are Machine Learning Cloud APIs Used Correctly? (#421)” in ICSE2021☆16Feb 27, 2021Updated 5 years ago
- [EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'☆27Oct 9, 2023Updated 2 years ago
- A set of basic tools for manipulating SyGuS benchmarks☆25Sep 7, 2023Updated 2 years ago
- ☆21Oct 6, 2021Updated 4 years ago
- Code Generator☆23Feb 16, 2023Updated 3 years ago
- Code for generating the JuICe dataset.☆37Oct 27, 2021Updated 4 years ago
- Building Training Datasets for Deep Learning Models in Software Engineering and Empirical Software Engineering Research☆26Jun 26, 2024Updated last year
- This repo illustrates how to evaluate the artifacts in the paper An Extensive Study on Pre-trained Models for Program Understanding and G…☆27Aug 12, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Fast tokenization and structural analysis of any programming language☆62Jan 14, 2025Updated last year
- PLUR (Programming-Language Understanding and Repair) is a collection of source code datasets suitable for graph-based machine learning. W…☆90Apr 5, 2022Updated 4 years ago
- ☆15Jan 19, 2020Updated 6 years ago
- ☆43Jan 1, 2025Updated last year
- Recent Advances in Programming Language Pre-Trained Models (PL-PTMs)☆60Dec 17, 2021Updated 4 years ago
- Mining tool and large-scale datasets of single statement bug fixes in Python☆19Nov 29, 2023Updated 2 years ago
- ☆12Dec 29, 2022Updated 3 years ago