github / CodeSearchNet
Datasets, tools, and benchmarks for representation learning of code.
☆2,281Updated 3 years ago
Alternatives and similar repositories for CodeSearchNet:
Users that are interested in CodeSearchNet are comparing it to the libraries listed below
- CodeXGLUE☆1,642Updated 11 months ago
- CodeBERT☆2,447Updated last year
- TensorFlow code for the neural network presented in the paper: "code2vec: Learning Distributed Representations of Code"☆1,127Updated last year
- Language-Agnostic SEntence Representations☆3,629Updated 10 months ago
- Code for the model presented in the paper: "code2seq: Generating Sequences from Structured Representations of Code"☆558Updated 7 months ago
- Conditional Transformer Language Model for Controllable Generation☆1,879Updated 3 years ago
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,142Updated last year
- DeepCS: Deep Code Search☆280Updated 2 years ago
- jiant is an nlp toolkit☆1,664Updated last year
- A lightning fast Finite State machine and REgular expression manipulation library.☆1,837Updated 3 months ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,904Updated 2 years ago
- Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conve…☆4,164Updated 10 months ago
- This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.☆1,864Updated 3 months ago
- NLP made easy☆2,559Updated last year
- A large annotated semantic parsing corpus for developing natural language interfaces.☆1,711Updated last year
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,352Updated last year
- Library to scrape and clean web pages to create massive datasets.☆2,182Updated 4 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,183Updated last year
- A natural language modeling framework based on PyTorch☆6,324Updated 2 years ago
- Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages☆7,411Updated last week
- ✨Fast Coreference Resolution in spaCy with Neural Networks☆2,869Updated last year
- An open-source NLP research library, built on PyTorch.☆11,834Updated 2 years ago
- Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://…☆2,388Updated 3 years ago
- Home of CodeT5: Open Code LLMs for Code Understanding and Generation☆2,943Updated last year
- The implementation of DeBERTa☆2,064Updated last year
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"