Public release of the TransCoder research project https://arxiv.org/pdf/2006.03511.pdf
☆1,726Sep 29, 2021Updated 4 years ago
Alternatives and similar repositories for TransCoder
Users that are interested in TransCoder are comparing it to the libraries listed below
Sorting:
- Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from d…☆771Mar 12, 2024Updated last year
- DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks☆1,267Mar 2, 2023Updated 2 years ago
- Pretrained Language Models for Source code☆253Jun 1, 2021Updated 4 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,159Sep 30, 2025Updated 5 months ago
- Datasets, tools, and benchmarks for representation learning of code.☆2,412Jan 31, 2022Updated 4 years ago
- The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic …☆3,636Feb 20, 2026Updated last week
- Contrastive Code Representation Learning: functionality-based JavaScript embeddings through self-supervised learning☆169Dec 26, 2021Updated 4 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)☆5,618Jan 12, 2026Updated last month
- CodeXGLUE☆1,801Apr 23, 2024Updated last year
- CodeBERT☆2,737Jul 9, 2023Updated 2 years ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,490Jan 14, 2026Updated last month
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.☆30,860Updated this week
- Repository for the paper "Optimal Subarchitecture Extraction for BERT"☆470Jun 22, 2022Updated 3 years ago
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆23,667Aug 15, 2024Updated last year
- NVIDIA's Deep Imagination Team's PyTorch Library☆4,076Nov 29, 2022Updated 3 years ago
- Google Research☆37,290Feb 20, 2026Updated last week
- Papers & presentation materials from Hugging Face's internal science day☆2,052Oct 31, 2020Updated 5 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,924Feb 14, 2023Updated 3 years ago
- A framework for training and evaluating AI models on a variety of openly available dialogue datasets.☆10,627Nov 3, 2023Updated 2 years ago
- An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.☆8,286Feb 25, 2022Updated 4 years ago
- FastFormers - highly efficient transformer models for NLU☆709Mar 21, 2025Updated 11 months ago
- This repository contains implementations and illustrative code to accompany DeepMind publications☆14,707Feb 20, 2026Updated last week
- CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.☆5,169Oct 27, 2025Updated 4 months ago
- This repository is to support contributions for tools for the Project CodeNet dataset hosted in DAX☆1,667Dec 21, 2025Updated 2 months ago
- A data augmentations library for audio, image, text, and video.☆5,071Feb 13, 2026Updated 2 weeks ago
- 🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools☆21,228Updated this week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆34,940Updated this week
- Official code of our work, AVATAR: A Parallel Corpus for Java-Python Program Translation.☆59Jul 31, 2024Updated last year
- GPT-3: Language Models are Few-Shot Learners☆15,752Sep 18, 2020Updated 5 years ago
- Production infrastructure for machine learning at scale☆8,031Jun 12, 2024Updated last year
- An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)☆451Updated this week
- Home of CodeT5: Open Code LLMs for Code Understanding and Generation☆3,097Jan 20, 2024Updated 2 years ago
- Code and data for ACL20 paper "Incorporating External Knowledge through Pre-training for Natural Language to Code Generation"☆97Sep 22, 2025Updated 5 months ago
- An open-source NLP research library, built on PyTorch.☆11,889Nov 22, 2022Updated 3 years ago
- Library for Knowledge Intensive Language Tasks☆965Mar 31, 2022Updated 3 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries☆7,392Feb 3, 2026Updated 3 weeks ago
- Code for the paper "Language Models are Unsupervised Multitask Learners"☆24,635Aug 14, 2024Updated last year
- DeLighT: Very Deep and Light-Weight Transformers☆469Oct 16, 2020Updated 5 years ago