A dataset for natural language code search.
☆14Feb 13, 2020Updated 6 years ago
Alternatives and similar repositories for CosBench
Users that are interested in CosBench are comparing it to the libraries listed below
Sorting:
- evaluation dataset consisting of natural language query and code snippet pairs☆124May 3, 2024Updated last year
- Code and plugin for paper "Automated Query Reformulation for Efficient Search based on Query Logs From Stack Overflow“☆16Nov 19, 2022Updated 3 years ago
- ☆19Dec 8, 2022Updated 3 years ago
- A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.☆55Feb 24, 2022Updated 4 years ago
- Source Code for ACL-21 main conference paper "CoSQA: 20,000+ Web Queries for Code Search and Question Answering".☆46Nov 2, 2022Updated 3 years ago
- A python library to build graphs for programs written in different programming languages.☆13May 6, 2022Updated 3 years ago
- Paper Artifacts for "Aroma: Code Recommendation via Structural Code Search"☆59Sep 20, 2021Updated 4 years ago
- ☆22Dec 26, 2020Updated 5 years ago
- ☆44Jun 24, 2025Updated 8 months ago
- StaQC: a systematically mined dataset containing around 148K Python and 120K SQL domain question-code pairs, as described in "StaQC: A Sy…☆172Aug 28, 2021Updated 4 years ago
- DeepCS: Deep Code Search☆283May 26, 2022Updated 3 years ago
- The guide of paper writing in Latex☆19Feb 7, 2025Updated last year
- This is the artifact for paper “Are Machine Learning Cloud APIs Used Correctly? (#421)” in ICSE2021☆16Feb 27, 2021Updated 5 years ago
- TDCleaner: A Tool for Detecting Obsolete TODO Comments in Software Repos☆12Dec 9, 2021Updated 4 years ago
- Probabilistic API Mining☆53Jan 8, 2018Updated 8 years ago
- CopyNet (Copy Mechanism in Seq2Seq) implementation with TensorFlow 2☆10Nov 21, 2022Updated 3 years ago
- Code Snippet Recommendation from Stack Overflow Post☆19Jun 30, 2021Updated 4 years ago
- Source code for EMSE 2023 paper "Zero-Shot Code Representation Learning via Prompt Tuning"☆13Feb 15, 2023Updated 3 years ago
- ☆16May 23, 2023Updated 2 years ago
- Deep Reinforcement Learning-Based Method for Capacity Vehicle Route Problem with Time Windows☆14Jul 28, 2024Updated last year
- NaturalCC: An Open-Source Toolkit for Code Intelligence☆315Feb 6, 2026Updated last month
- ☆12Oct 29, 2022Updated 3 years ago
- Repository for Deep API Learning (DeepAPI)☆56Dec 3, 2021Updated 4 years ago
- NLQF is a tool to filter query-appropriate comments for building high-quality code search datasets.☆19Feb 15, 2022Updated 4 years ago
- ☆75Oct 26, 2020Updated 5 years ago
- ☆11Jul 25, 2020Updated 5 years ago
- Learning Semantic Parsers from Denotations with Latent Structured Alignments and Abstract Programs(EMNLP2019)☆19Dec 3, 2019Updated 6 years ago
- Code for EMNLP'21 paper "Types of Out-of-Distribution Texts and How to Detect Them"☆25Nov 13, 2021Updated 4 years ago
- ☆15Jan 24, 2023Updated 3 years ago
- ☆10Apr 15, 2023Updated 2 years ago
- A collection of datasets for machine learning for big code☆62Oct 8, 2021Updated 4 years ago
- Incremental Python parser for constrained generation of code by LLMs.☆18Sep 18, 2024Updated last year
- Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure an…☆15May 18, 2022Updated 3 years ago
- ☆14Feb 2, 2023Updated 3 years ago
- ☆10Feb 1, 2023Updated 3 years ago
- Learning to Update Natural Language Comments Based on Code Changes: Artifact☆33Oct 24, 2020Updated 5 years ago
- Code for paper "Lancer: Your Code Tell Me What You Need"☆11Jun 17, 2022Updated 3 years ago
- The dataset and source code for CugLM☆15Sep 1, 2020Updated 5 years ago
- Code for EMNLP2019 paper "Low-Resource Response Generation with Template Prior"☆13Jan 17, 2020Updated 6 years ago