github / CodeSearchNetLinks
Datasets, tools, and benchmarks for representation learning of code.
☆2,404Updated 3 years ago
Alternatives and similar repositories for CodeSearchNet
Users that are interested in CodeSearchNet are comparing it to the libraries listed below
Sorting:
- CodeXGLUE☆1,796Updated last year
- TensorFlow code for the neural network presented in the paper: "code2vec: Learning Distributed Representations of Code"☆1,140Updated 2 years ago
- CodeBERT☆2,717Updated 2 years ago
- Code for the model presented in the paper: "code2seq: Generating Sequences from Structured Representations of Code"☆564Updated 5 months ago
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆292Updated 11 months ago
- This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.☆1,925Updated last year
- source{d} datasets ("big code") for source code analysis and machine learning on source code☆343Updated 6 years ago
- jiant is an nlp toolkit☆1,674Updated 2 years ago
- Conditional Transformer Language Model for Controllable Generation☆1,884Updated 8 months ago
- Papers & presentation materials from Hugging Face's internal science day☆2,053Updated 5 years ago
- Shared repository for open-sourced projects from the Google AI Language team.☆1,733Updated 3 weeks ago
- Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from d…☆767Updated last year
- ☆1,625Updated 2 years ago
- Language-Agnostic SEntence Representations☆3,660Updated last year
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the…☆2,070Updated last year
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,922Updated 2 years ago
- DeepCS: Deep Code Search☆284Updated 3 years ago
- The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic …☆3,623Updated last month
- A large annotated semantic parsing corpus for developing natural language interfaces.☆1,795Updated 3 months ago
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"☆1,629Updated 2 years ago
- Trax — Deep Learning with Clear Code and Speed☆8,298Updated 3 months ago
- This repository is to support contributions for tools for the Project CodeNet dataset hosted in DAX☆1,655Updated 2 weeks ago
- Crawl BookCorpus☆849Updated 2 years ago
- Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://…☆2,389Updated 4 years ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,468Updated 2 months ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,368Updated last year
- Library to scrape and clean web pages to create massive datasets.☆2,227Updated 5 years ago
- Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is design…☆1,083Updated 4 years ago
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,153Updated last year
- A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering☆726Updated 2 months ago