HA-Transformer / MAT
The implementation of multi-branch attentive Transformer (MAT).
☆33Updated 4 years ago
Alternatives and similar repositories for MAT:
Users that are interested in MAT are comparing it to the libraries listed below
- Code for EMNLP 2020 paper CoDIR☆41Updated 2 years ago
- DisCo Transformer for Non-autoregressive MT☆78Updated 2 years ago
- Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"☆32Updated 4 years ago
- ☆22Updated 3 years ago
- Code for paper "Continual and Multi-Task Architecture Search (ACL 2019)"☆41Updated 5 years ago
- PyTorch Implementation of NeurIPS 2020 paper "Learning Sparse Prototypes for Text Generation"☆22Updated 3 years ago
- Language Model Baselines for PyTorch☆42Updated 4 years ago
- Code for "Counterfactual Variable Control for Robust and Interpretable Question Answering"☆14Updated 4 years ago
- ☆28Updated 3 years ago
- ☆32Updated 3 years ago
- Code for the article "Automatic Temperature Control for Neural Machine Translation" (EMNLP 2018)☆13Updated 5 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆22Updated 2 years ago
- a Pytorch implementation of the Reformer Network (https://openreview.net/pdf?id=rkgNKkHtvB)☆53Updated 2 years ago
- Source code for "Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation"☆18Updated 5 years ago
- [ACL‘20] Highway Transformer: A Gated Transformer.☆32Updated 3 years ago
- ☆14Updated 2 years ago
- Official code for the ICLR 2020 paper 'ARE PPE-TRAINED LANGUAGE MODELS AWARE OF PHRASES? SIMPLE BUT STRONG BASELINES FOR GRAMMAR INDCUTIO…☆30Updated last year
- ☆31Updated 5 years ago
- Implementation of Soft-Label Chain Conditional Random Field for Phrase Grounding in PyTorch☆16Updated 2 years ago
- Code for "Understanding and Improving Layer Normalization"☆46Updated 5 years ago
- Text Content Manipulation☆44Updated 4 years ago
- 👾 A library of state-of-the-art pretrained models for Natural Language Processing (NLP)☆9Updated 5 years ago
- Code and dataset for "Transfer Learning Between Related Tasks Using Expected Label Proportions"☆16Updated 5 years ago
- Curriculum Learning related papers and materials☆54Updated 4 years ago
- ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation☆25Updated 4 years ago
- Code for "Understanding Neural Abstractive Summarization Models via Uncertainty" (EMNLP20)☆30Updated 4 years ago
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Updated last year
- PyTorch implementation of Transformer-based Neural Machine Translation☆77Updated 2 years ago
- Code for paper "Interactive Machine Comprehension with Information Seeking Agents" -- public version☆23Updated 5 years ago
- Learn models that are robust to spurious correlations in the dataset.☆26Updated 5 years ago