juntang-zhuang / fairseq-adabeliefLinks

☆9

Alternatives and similar repositories for fairseq-adabelief

Users that are interested in fairseq-adabelief are comparing it to the libraries listed below

Sorting:

intersun / CoDIR
Code for EMNLP 2020 paper CoDIR
☆41Updated 2 years ago
prajjwal1 / adaptive_transformer
Code for the paper "Adaptive Transformers for Learning Multimodal Representations" (ACL SRW 2020)
☆43Updated 2 years ago
lucidrains / distilled-retriever-pytorch
Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"
☆32Updated 4 years ago
cyk1337 / Highway-Transformer
[ACL‘20] Highway Transformer: A Gated Transformer.
☆33Updated 3 years ago
lxk00 / BERT-EMD
☆50Updated 2 years ago
jxhe / sparse-text-prototype
PyTorch Implementation of NeurIPS 2020 paper "Learning Sparse Prototypes for Text Generation"
☆22Updated 4 years ago
iedwardwangi / MetaAdapter
☆22Updated 4 years ago
lancopku / Prime
A simple module consistently outperforms self-attention and Transformer model on main NMT datasets with SoTA performance.
☆85Updated 2 years ago
microsoft / LiST
Lite Self-Training
☆29Updated 2 years ago
minghao-wu / CRF-AE
Code for EMNLP 2018 paper https://arxiv.org/pdf/1808.09075.pdf
☆38Updated 6 years ago
wenhuchen / GPT2-Logic2Text
The code for Template-GPT-2 Generation Model for Logic2Text Dataset
☆18Updated 5 years ago
lucidrains / coco-lm-pytorch
Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch
☆46Updated 4 years ago
vikas95 / AIR-retriever
AIR retriever for Multi-Hop QA (ACL 2020 paper)
☆30Updated 5 years ago
JetRunner / PABEE
Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".
☆65Updated 4 years ago
MurtyShikhar / ExpBERT
Code for our ACL '20 paper "Representation Engineering with Natural Language Explanations"
☆29Updated 5 years ago
10-zin / Synthesizer
A PyTorch implementation of the paper - "Synthesizer: Rethinking Self-Attention in Transformer Models"
☆73Updated 2 years ago
castorini / d-bert
Distilling BERT using natural language generation.
☆38Updated last year
allenai / unifew
Unifew: Unified Fewshot Learning Model
☆18Updated 3 years ago
Lingkai-Kong / Calibrated-BERT-Fine-Tuning
Code for Paper: Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data
☆36Updated 4 years ago
nttcslab-nlp / doc_lm
☆12Updated 6 years ago
linzehui / Curriculum-Learning-PaperList-Materials
Curriculum Learning related papers and materials
☆54Updated 4 years ago
fuzihaofzh / repetition-problem-nlg
Code for the paper "A Theoretical Analysis of the Repetition Problem in Text Generation" in AAAI 2021.
☆54Updated 2 years ago
cliang1453 / super-structured-lottery-tickets
Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization (ACL 2021)
☆17Updated 4 years ago
belindal / TaskBench500
Suite of 500 procedurally-generated NLP tasks to study language model adaptability
☆21Updated 3 years ago
thunlp / TR-BERT
Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"
☆47Updated 3 years ago
salesforce / FactLM
☆11Updated last month
ShaojieJiang / tldr
Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"
☆10Updated 2 years ago
lifu-tu / ENGINE
ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation
☆25Updated 4 years ago
yuchenlin / CDMA-NER
☆28Updated 6 years ago
Alibaba-NLP / AIN
Code for our EMNLP 2020 Paper "AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network"
☆19Updated 2 years ago