github / CodeSearchNetLinks
Datasets, tools, and benchmarks for representation learning of code.
☆2,374Updated 3 years ago
Alternatives and similar repositories for CodeSearchNet
Users that are interested in CodeSearchNet are comparing it to the libraries listed below
Sorting:
- CodeXGLUE☆1,747Updated last year
- TensorFlow code for the neural network presented in the paper: "code2vec: Learning Distributed Representations of Code"☆1,144Updated 2 years ago
- CodeBERT☆2,630Updated 2 years ago
- Code for the model presented in the paper: "code2seq: Generating Sequences from Structured Representations of Code"☆562Updated 2 months ago
- Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from d…☆762Updated last year
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,425Updated 4 months ago
- jiant is an nlp toolkit☆1,670Updated 2 years ago
- DeepCS: Deep Code Search☆282Updated 3 years ago
- source{d} datasets ("big code") for source code analysis and machine learning on source code☆336Updated 5 years ago
- Conditional Transformer Language Model for Controllable Generation☆1,885Updated 4 months ago
- Public release of the TransCoder research project https://arxiv.org/pdf/2006.03511.pdf☆1,722Updated 3 years ago
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆291Updated 7 months ago
- 🦄 State-of-the-Art Conversational AI with Transfer Learning☆1,748Updated 2 years ago
- Code For Medium Article: "How To Create Natural Language Semantic Search for Arbitrary Objects With Deep Learning"☆488Updated 2 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,923Updated 2 years ago
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the…☆2,054Updated last year
- Code and model for the paper "Improving Language Understanding by Generative Pre-Training"☆2,242Updated 6 years ago
- Library to scrape and clean web pages to create massive datasets.☆2,208Updated 4 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,362Updated last year
- A library for mining of path-based representations of code (and more)☆295Updated last year
- This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.☆1,905Updated 9 months ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,180Updated 2 years ago
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"☆1,628Updated 2 years ago
- Papers & presentation materials from Hugging Face's internal science day☆2,050Updated 4 years ago
- LAnguage Model Analysis☆1,391Updated last year
- ALBERT: A Lite BERT for Self-supervised Learning of Language Representations☆3,274Updated 2 years ago
- End-to-end neural table-text understanding models.☆1,197Updated last year
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,151Updated last year
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.☆1,753Updated last year
- Home of CodeT5: Open Code LLMs for Code Understanding and Generation☆3,061Updated last year