Models and datasets for annotated code search.
☆35May 22, 2023Updated 2 years ago
Alternatives and similar repositories for codesearch
Users that are interested in codesearch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Oct 22, 2020Updated 5 years ago
- a contextual search engine for software packages built on import2vec embeddings (https://www.code-compass.com)☆38Jan 14, 2026Updated 2 months ago
- Source Code for ACL-21 main conference paper "CoSQA: 20,000+ Web Queries for Code Search and Question Answering".☆47Nov 2, 2022Updated 3 years ago
- ☆19Dec 8, 2022Updated 3 years ago
- ☆23Mar 25, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A dataset for natural language code search.☆14Feb 13, 2020Updated 6 years ago
- ☆12Nov 14, 2021Updated 4 years ago
- A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.☆55Feb 24, 2022Updated 4 years ago
- StaQC: a systematically mined dataset containing around 148K Python and 120K SQL domain question-code pairs, as described in "StaQC: A Sy…☆172Aug 28, 2021Updated 4 years ago
- ☆11Jul 25, 2020Updated 5 years ago
- Improving Code Readability Classification using Convolutional Neural Networks☆10Apr 18, 2018Updated 7 years ago
- Web queries dataset for code search☆32Jun 3, 2023Updated 2 years ago
- Code for "Deep Graph Matching and Searching for Semantic Code Retrieval"☆24Oct 15, 2021Updated 4 years ago
- Neural bag of words code search implementation using PyTorch and data from the CodeSearchNet project.☆72Jan 6, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆44Jun 24, 2025Updated 9 months ago
- Transformer-based approaches for an efficient docstrings generation on a piece of Python's code.☆17Feb 16, 2026Updated last month
- Code generation from natural language with less prior and more monolingual data☆13Aug 24, 2021Updated 4 years ago
- An audit tool for software projects based on the ISO/IEC 25010 specification☆10Jul 27, 2020Updated 5 years ago
- A benchmark for evaluating embeddings of identifiers in source code.☆22Aug 23, 2021Updated 4 years ago
- Mapping Language to Code in a Programmatic Context☆80Jan 27, 2021Updated 5 years ago
- source code for "Multi-Modal Attention Network Learning for Semantic Source Code Retrieval"☆20Jun 2, 2021Updated 4 years ago
- Contrastive Code Representation Learning: functionality-based JavaScript embeddings through self-supervised learning☆169Dec 26, 2021Updated 4 years ago
- A profiler for clojure STM☆23Mar 13, 2012Updated 14 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- TDCleaner: A Tool for Detecting Obsolete TODO Comments in Software Repos☆12Dec 9, 2021Updated 4 years ago
- DeepCS: Deep Code Search☆283May 26, 2022Updated 3 years ago
- The replication package of <Sentiment Analysis for Software Engineering: How Far Can Pre-trained Transformer Models Go?>. Accepted by IC…☆11Nov 29, 2023Updated 2 years ago
- Dataset and code for Findings of EMNLP'21 paper "CodeQA: A Question Answering Dataset for Source Code Comprehension".☆43Dec 23, 2023Updated 2 years ago
- This is the artifact for paper “Are Machine Learning Cloud APIs Used Correctly? (#421)” in ICSE2021☆16Feb 27, 2021Updated 5 years ago
- [EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'☆27Oct 9, 2023Updated 2 years ago
- FaCoY Code-to-Code Search Engine☆34Jan 18, 2019Updated 7 years ago
- A set of basic tools for manipulating SyGuS benchmarks☆25Sep 7, 2023Updated 2 years ago
- Code Generator☆23Feb 16, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code for generating the JuICe dataset.☆37Oct 27, 2021Updated 4 years ago
- This repo is the implementation of the paper "GraphSearchNet: Enhancing GNNs via Capturing Global Dependency for Semantic Code Search". W…☆32Dec 31, 2022Updated 3 years ago
- This repo illustrates how to evaluate the artifacts in the paper An Extensive Study on Pre-trained Models for Program Understanding and G…☆27Aug 12, 2022Updated 3 years ago
- Experiments with Langchain using different approaches on Google colab☆25Mar 29, 2024Updated 2 years ago
- Fast tokenization and structural analysis of any programming language☆62Jan 14, 2025Updated last year
- PLUR (Programming-Language Understanding and Repair) is a collection of source code datasets suitable for graph-based machine learning. W…☆90Apr 5, 2022Updated 4 years ago
- ☆15Jan 19, 2020Updated 6 years ago