github / CodeSearchNetLinks
Datasets, tools, and benchmarks for representation learning of code.
☆2,378Updated 3 years ago
Alternatives and similar repositories for CodeSearchNet
Users that are interested in CodeSearchNet are comparing it to the libraries listed below
Sorting:
- CodeXGLUE☆1,757Updated last year
- TensorFlow code for the neural network presented in the paper: "code2vec: Learning Distributed Representations of Code"☆1,141Updated 2 years ago
- CodeBERT☆2,652Updated 2 years ago
- Code for the model presented in the paper: "code2seq: Generating Sequences from Structured Representations of Code"☆563Updated 3 months ago
- Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from d…☆765Updated last year
- Conditional Transformer Language Model for Controllable Generation☆1,884Updated 5 months ago
- DeepCS: Deep Code Search☆283Updated 3 years ago
- This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.☆1,909Updated 9 months ago
- source{d} datasets ("big code") for source code analysis and machine learning on source code☆337Updated 5 years ago
- Code for the paper "Evaluating Large Language Models Trained on Code"☆2,973Updated 9 months ago
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"☆1,629Updated 2 years ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,438Updated 5 months ago
- jiant is an nlp toolkit☆1,668Updated 2 years ago
- Public release of the TransCoder research project https://arxiv.org/pdf/2006.03511.pdf☆1,724Updated 4 years ago
- Papers & presentation materials from Hugging Face's internal science day☆2,050Updated 4 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,922Updated 2 years ago
- Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is design…☆1,055Updated 4 years ago
- Language-Agnostic SEntence Representations☆3,648Updated last year
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆291Updated 8 months ago
- Guide to using pre-trained large language models of source code☆1,838Updated last year
- Open clone of OpenAI's unreleased WebText dataset scraper. This version uses pushshift.io files instead of the API for speed.☆738Updated 2 years ago
- A lightning fast Finite State machine and REgular expression manipulation library.☆1,862Updated 10 months ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,363Updated last year
- This repository is to support contributions for tools for the Project CodeNet dataset hosted in DAX☆1,627Updated last month
- Dataset of GPT-2 outputs for research in detection, biases, and more☆1,998Updated last year
- The Natural Language Decathlon: A Multitask Challenge for NLP☆2,347Updated 5 months ago
- Shared repository for open-sourced projects from the Google AI Language team.☆1,712Updated last month
- ☆1,607Updated 2 years ago
- 🦄 State-of-the-Art Conversational AI with Transfer Learning☆1,748Updated 2 years ago
- The implementation of DeBERTa☆2,158Updated 2 years ago