github / CodeSearchNetLinks
Datasets, tools, and benchmarks for representation learning of code.
☆2,411Updated 4 years ago
Alternatives and similar repositories for CodeSearchNet
Users that are interested in CodeSearchNet are comparing it to the libraries listed below
Sorting:
- CodeXGLUE☆1,799Updated last year
- TensorFlow code for the neural network presented in the paper: "code2vec: Learning Distributed Representations of Code"☆1,141Updated 2 years ago
- CodeBERT☆2,726Updated 2 years ago
- Code for the model presented in the paper: "code2seq: Generating Sequences from Structured Representations of Code"☆565Updated 6 months ago
- Conditional Transformer Language Model for Controllable Generation☆1,885Updated 9 months ago
- This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.☆1,933Updated last year
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,488Updated 3 weeks ago
- source{d} datasets ("big code") for source code analysis and machine learning on source code☆343Updated 6 years ago
- Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from d…☆768Updated last year
- Home of CodeT5: Open Code LLMs for Code Understanding and Generation☆3,099Updated 2 years ago
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆291Updated 11 months ago
- DeepCS: Deep Code Search☆283Updated 3 years ago
- Papers & presentation materials from Hugging Face's internal science day☆2,053Updated 5 years ago
- This repository is to support contributions for tools for the Project CodeNet dataset hosted in DAX☆1,660Updated last month
- A library for mining of path-based representations of code (and more)☆299Updated 2 months ago
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,155Updated last year
- A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering☆730Updated 3 months ago
- Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://…☆2,390Updated 4 years ago
- jiant is an nlp toolkit☆1,675Updated 2 years ago
- Language-Agnostic SEntence Representations☆3,657Updated last year
- Large datasets for conversational AI☆1,383Updated 6 years ago
- Shared repository for open-sourced projects from the Google AI Language team.☆1,745Updated 2 weeks ago
- APPS: Automated Programming Progress Standard (NeurIPS 2021)☆501Updated last year
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,371Updated last year
- Code for Defending Against Neural Fake News, https://rowanzellers.com/grover/☆919Updated 2 years ago
- Public release of the TransCoder research project https://arxiv.org/pdf/2006.03511.pdf☆1,727Updated 4 years ago
- Dataset of GPT-2 outputs for research in detection, biases, and more☆2,014Updated 2 years ago
- Preprocessed Python functions and docstrings for automated code documentation (code2doc) and automated code generation (doc2code) tasks.☆211Updated 5 years ago
- Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].☆186Updated 3 years ago
- Code and model for the paper "Improving Language Understanding by Generative Pre-Training"☆2,271Updated 7 years ago