microsoft / fastseqLinks

An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/pdf/2106.04718.pdf

☆432

Alternatives and similar repositories for fastseq

Users that are interested in fastseq are comparing it to the libraries listed below

Sorting:

alexa / bort
Repository for the paper "Optimal Subarchitecture Extraction for BERT"
☆471Updated 3 years ago
microsoft / MPNet
MPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf
☆292Updated 4 years ago
princeton-nlp / DensePhrases
[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…
☆606Updated 3 years ago
microsoft / fastformers
FastFormers - highly efficient transformer models for NLU
☆706Updated 7 months ago
google-research / xtreme
XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…
☆648Updated 2 years ago
JetRunner / BERT-of-Theseus
⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).
☆315Updated 2 years ago
yaoxingcheng / TLM
ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework
☆255Updated last year
allenai / dont-stop-pretraining
Code associated with the Don't Stop Pretraining ACL 2020 paper
☆537Updated 3 years ago
neulab / ExplainaBoard
Interpretable Evaluation for AI Systems
☆365Updated 2 years ago
IntelLabs / academic-budget-bert
Repository containing code for "How to Train BERT with an Academic Budget" paper
☆315Updated 2 years ago
facebookresearch / SentAugment
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…
☆362Updated 3 years ago
alexa / dialoglue
DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue
☆285Updated 2 years ago
richarddwang / electra_pytorch
Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)
☆330Updated last year
microsoft / ANCE
A novel embedding training algorithm leveraging ANN search and achieved SOTA retrieval on Trec DL 2019 and OpenQA benchmarks
☆377Updated 2 years ago
awasthiabhijeet / PIE
Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models …
☆231Updated 2 years ago
yitu-opensource / ConvBert
☆254Updated 3 years ago
voidism / DiffCSE
Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"
☆296Updated 3 years ago
glample / fastBPE
Fast BPE
☆678Updated last year
golsun / DialogRPT
EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"
☆344Updated 11 months ago
Ki6an / fastT5
⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.
☆587Updated 2 years ago
allenai / naacl2021-longdoc-tutorial
☆345Updated 4 years ago
microsoft / task_oriented_dialogue_as_dataflow_synthesis
Code to reproduce experiments in the paper "Task-Oriented Dialogue as Dataflow Synthesis" (TACL 2020).
☆308Updated last year
LiyuanLucasLiu / Transformer-Clinic
Understanding the Difficulty of Training Transformers
☆330Updated 3 years ago
facebookresearch / MLQA
New dataset
☆308Updated 4 years ago
allenai / allennlp-models
Officially supported AllenNLP models
☆553Updated 2 years ago
graykode / ALBERT-Pytorch
Pytorch Implementation of ALBERT(A Lite BERT for Self-supervised Learning of Language Representations)
☆227Updated 4 years ago
neulab / InterpretEval
Interpretable Evaluation for (Almost) All NLP Tasks
☆195Updated last month
michiyasunaga / LM-Critic
[EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction
☆120Updated 4 years ago
microsoft / infinibatch
Efficient, check-pointed data loading for deep learning with massive data sets.
☆209Updated 2 years ago
facebookresearch / PAQ
Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"
☆207Updated 4 years ago