github / CodeSearchNetLinks
Datasets, tools, and benchmarks for representation learning of code.
☆2,388Updated 3 years ago
Alternatives and similar repositories for CodeSearchNet
Users that are interested in CodeSearchNet are comparing it to the libraries listed below
Sorting:
- CodeXGLUE☆1,770Updated last year
- TensorFlow code for the neural network presented in the paper: "code2vec: Learning Distributed Representations of Code"☆1,141Updated 2 years ago
- CodeBERT☆2,672Updated 2 years ago
- Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from d…☆766Updated last year
- Code for the model presented in the paper: "code2seq: Generating Sequences from Structured Representations of Code"☆564Updated 4 months ago
- source{d} datasets ("big code") for source code analysis and machine learning on source code☆338Updated 5 years ago
- jiant is an nlp toolkit☆1,671Updated 2 years ago
- This repository is to support contributions for tools for the Project CodeNet dataset hosted in DAX☆1,641Updated 2 months ago
- Conditional Transformer Language Model for Controllable Generation☆1,884Updated 6 months ago
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆292Updated 9 months ago
- ☆1,611Updated 2 years ago
- Public release of the TransCoder research project https://arxiv.org/pdf/2006.03511.pdf☆1,722Updated 4 years ago
- Large datasets for conversational AI☆1,367Updated 6 years ago
- Dataset of GPT-2 outputs for research in detection, biases, and more☆2,001Updated last year
- Papers & presentation materials from Hugging Face's internal science day☆2,052Updated 5 years ago
- A library for mining of path-based representations of code (and more)☆298Updated last week
- DeepCS: Deep Code Search☆284Updated 3 years ago
- This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.☆1,916Updated 10 months ago
- Home of CodeT5: Open Code LLMs for Code Understanding and Generation☆3,081Updated last year
- Code and model for the paper "Improving Language Understanding by Generative Pre-Training"☆2,257Updated 6 years ago
- Guide to using pre-trained large language models of source code☆1,843Updated last year
- End-to-end neural table-text understanding models.☆1,204Updated last year
- Beyond Accuracy: Behavioral Testing of NLP models with CheckList☆2,043Updated last year
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,151Updated last year
- Preprocessed Python functions and docstrings for automated code documentation (code2doc) and automated code generation (doc2code) tasks.☆212Updated 5 years ago
- ☆1,643Updated 2 years ago
- GNES is Generic Neural Elastic Search, a cloud-native semantic search system based on deep neural network.☆1,267Updated 6 years ago
- DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks☆1,267Updated 2 years ago
- A large annotated semantic parsing corpus for developing natural language interfaces.☆1,782Updated last month
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.☆1,753Updated last year