Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆22Jan 25, 2023Updated 3 years ago
Alternatives and similar repositories for fairseq
Users that are interested in fairseq are comparing it to the libraries listed below
Sorting:
- Code for ACL2020 "Jointly Masked Sequence-to-Sequence Model for Non-Autoregressive Neural Machine Translation"☆39Jun 24, 2020Updated 5 years ago
- ☆12Jun 15, 2021Updated 4 years ago
- This repository contains example code to build models on TPUs☆30Feb 17, 2023Updated 3 years ago
- Implementation of the Optimal Completion Distillation for Sequence Labeling☆17Jul 25, 2024Updated last year
- Hard-Coded Gaussian Attention for Neural Machine Translation☆36May 22, 2023Updated 2 years ago
- Cross-Lingual Machine Reading Comprehension (EMNLP 2019)☆67Nov 6, 2019Updated 6 years ago
- Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"☆20Nov 12, 2021Updated 4 years ago
- Dilation Gate CNN For Machine Reading Comprehension☆17Mar 24, 2023Updated 2 years ago
- Progressively Pretrained Dense Corpus Index for Open-Domain QA and Information Retrieval☆43Jun 12, 2023Updated 2 years ago
- A list of advisory blogs and resources that I have found useful so far.☆22Nov 25, 2020Updated 5 years ago
- LM Pretraining with PyTorch/TPU☆137Oct 24, 2019Updated 6 years ago
- RP-GAN: Stable GAN Training with Random Projections☆22Jun 27, 2018Updated 7 years ago
- ☆21May 5, 2020Updated 5 years ago
- This repository contains code to replicate the no-longer publicly available Toronto BookCorpus dataset☆49Apr 6, 2022Updated 3 years ago
- Codes for "Understanding and Improving Transformer From a Multi-Particle Dynamic System Point of View"☆147Jun 10, 2019Updated 6 years ago
- ☆31Jun 28, 2022Updated 3 years ago
- A Pytorch Implementation of MelNet☆26Apr 13, 2020Updated 5 years ago
- ☆64Jul 17, 2020Updated 5 years ago
- A masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a…☆246Sep 17, 2021Updated 4 years ago
- Sparse Backpropagation for Mixture-of-Expert Training☆29Jul 2, 2024Updated last year
- Curated Lists for graph neural network, graph convolutional network, graph attention network, etc.☆27Apr 22, 2019Updated 6 years ago
- Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”☆31May 1, 2023Updated 2 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- ☆10Feb 2, 2021Updated 5 years ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 7 months ago
- AAAI-20 paper: Cross-Lingual Natural Language Generation via Pre-Training☆129Aug 4, 2021Updated 4 years ago
- [Work in progress] A reading list for machine commonsense reasoning☆34Apr 14, 2020Updated 5 years ago
- ☆33Nov 7, 2019Updated 6 years ago
- Tensorflow implementation of Bi-directional RNN Langauge Model☆38Jul 28, 2018Updated 7 years ago
- The implementation of "Does Multi-Encoder Help? A Case Study on Context-AwareNeural Machine Translation"☆39Aug 26, 2020Updated 5 years ago
- ☆12Feb 22, 2021Updated 5 years ago
- ☆36Aug 25, 2022Updated 3 years ago
- Code and performance tests to demonstrate the COUNTLESS algorithm. https://medium.com/@willsilversmith/countless-high-performance-2x-down…☆10Oct 23, 2019Updated 6 years ago
- Modified version of fairseq, including new implementations for criterions using reinforcement learning methods.☆11Aug 14, 2019Updated 6 years ago
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆21Jan 6, 2026Updated last month
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Nov 7, 2021Updated 4 years ago
- Source code for our "MMM" paper at AAAI 2020☆40May 4, 2020Updated 5 years ago
- FaVIQ: Fact Verification from Information-seeking Questions☆43Nov 23, 2022Updated 3 years ago
- Language Model Fine-tuning for Moby Dick☆42Mar 3, 2019Updated 7 years ago