A dataset for natural language code search.
☆14Feb 13, 2020Updated 6 years ago
Alternatives and similar repositories for CosBench
Users that are interested in CosBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Dec 8, 2022Updated 3 years ago
- A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.☆55Feb 24, 2022Updated 4 years ago
- Paper Artifacts for "Aroma: Code Recommendation via Structural Code Search"☆60Sep 20, 2021Updated 4 years ago
- ☆23Mar 25, 2023Updated 3 years ago
- ☆44Jun 24, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- StaQC: a systematically mined dataset containing around 148K Python and 120K SQL domain question-code pairs, as described in "StaQC: A Sy…☆172Aug 28, 2021Updated 4 years ago
- This is the artifact for paper “Are Machine Learning Cloud APIs Used Correctly? (#421)” in ICSE2021☆16Feb 27, 2021Updated 5 years ago
- TDCleaner: A Tool for Detecting Obsolete TODO Comments in Software Repos☆12Dec 9, 2021Updated 4 years ago
- Probabilistic API Mining☆53Jan 8, 2018Updated 8 years ago
- CopyNet (Copy Mechanism in Seq2Seq) implementation with TensorFlow 2☆10Nov 21, 2022Updated 3 years ago
- Code Snippet Recommendation from Stack Overflow Post☆19Jun 30, 2021Updated 4 years ago
- Source code for EMSE 2023 paper "Zero-Shot Code Representation Learning via Prompt Tuning"☆13Feb 15, 2023Updated 3 years ago
- ☆17May 23, 2023Updated 2 years ago
- A benchmark for evaluating embeddings of identifiers in source code.☆22Aug 23, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- NaturalCC: An Open-Source Toolkit for Code Intelligence☆315May 3, 2026Updated 2 weeks ago
- ☆12Oct 29, 2022Updated 3 years ago
- Repository for Deep API Learning (DeepAPI)☆58Dec 3, 2021Updated 4 years ago
- Learning Semantic Parsers from Denotations with Latent Structured Alignments and Abstract Programs(EMNLP2019)☆19Dec 3, 2019Updated 6 years ago
- Here, we open source our measurement dataset and source code on IFTTT☆11Oct 23, 2018Updated 7 years ago
- Source code for the paper "Neural Multi-Step Reasoning for Question Answering on Semi-Structured Tables"☆20May 18, 2017Updated 9 years ago
- ☆15Jan 24, 2023Updated 3 years ago
- ☆16Jul 20, 2025Updated 10 months ago
- ☆10Apr 15, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The dataset and source code for paper "API Method Recommendation without Worrying About the Task-API Knowledge Gap"☆19Aug 20, 2018Updated 7 years ago
- Machine Learning based Source Code Clone validation tool.☆15May 8, 2019Updated 7 years ago
- A collection of datasets for machine learning for big code☆65Oct 8, 2021Updated 4 years ago
- Incremental Python parser for constrained generation of code by LLMs.☆18Sep 18, 2024Updated last year
- Learning from what we know: How to perform vulnerability prediction using noisy historical data, Empirical Software Engineering (EMSE)☆13Sep 20, 2023Updated 2 years ago
- Learning to Update Natural Language Comments Based on Code Changes: Artifact☆33Oct 24, 2020Updated 5 years ago
- Automation Testing☆10Apr 7, 2018Updated 8 years ago
- Code for paper "Lancer: Your Code Tell Me What You Need"☆11Jun 17, 2022Updated 3 years ago
- Repository of the paper 'CodeQueries: A Dataset of Semantic Queries over Code' published in ISEC 2024☆13Apr 21, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The dataset and source code for CugLM☆15Sep 1, 2020Updated 5 years ago
- SQL-to-Text is a simple code for translating SQL to Text Generation with a novel Graph-to-Sequence Model☆74Nov 19, 2018Updated 7 years ago
- Web queries dataset for code search☆32Jun 3, 2023Updated 2 years ago
- Semantic Code Search☆37Feb 24, 2023Updated 3 years ago
- PLUR (Programming-Language Understanding and Repair) is a collection of source code datasets suitable for graph-based machine learning. W…☆90Apr 5, 2022Updated 4 years ago
- Preprocessed Python functions and docstrings for automated code documentation (code2doc) and automated code generation (doc2code) tasks.☆211Jul 13, 2020Updated 5 years ago
- Generating Adversarial Examples for Holding Robustness of Source Code Processing Models☆17Dec 2, 2021Updated 4 years ago