microsoft / fastformersLinks

FastFormers - highly efficient transformer models for NLU

☆706

Alternatives and similar repositories for fastformers

Users that are interested in fastformers are comparing it to the libraries listed below

Sorting:

alexa / bort
Repository for the paper "Optimal Subarchitecture Extraction for BERT"
☆471Updated 3 years ago
microsoft / fastseq
An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…
☆432Updated 3 years ago
google-research / xtreme
XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…
☆648Updated 2 years ago
Lightning-Universe / lightning-transformers
Flexible components pairing 🤗 Transformers with Pytorch Lightning
☆612Updated 2 years ago
bhoov / exbert
A Visual Analysis Tool to Explore Learned Representations in Transformers Models
☆602Updated last year
richarddwang / electra_pytorch
Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)
☆330Updated last year
facebookresearch / SentAugment
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…
☆362Updated 3 years ago
glample / fastBPE
Fast BPE
☆678Updated last year
google-research / byt5
☆524Updated last year
huggingface / pytorch_block_sparse
Fast Block Sparse Matrices for Pytorch
☆547Updated 4 years ago
huggingface / nn_pruning
Prune a model while finetuning or training.
☆405Updated 3 years ago
tunib-ai / parallelformers
Parallelformers: An Efficient Model Parallelization Toolkit for Deployment
☆790Updated 2 years ago
asyml / texar-pytorch
Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CAS…
☆747Updated 3 years ago
Ki6an / fastT5
⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.
☆587Updated 2 years ago
LiyuanLucasLiu / Transformer-Clinic
Understanding the Difficulty of Training Transformers
☆330Updated 3 years ago
facebookresearch / bitsandbytes
Library for 8-bit optimizers and quantization routines.
☆780Updated 3 years ago
facebookresearch / vizseq
An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)
☆446Updated 4 months ago
IntelLabs / academic-budget-bert
Repository containing code for "How to Train BERT with an Academic Budget" paper
☆315Updated 2 years ago
facebookresearch / adaptive-span
Transformer training code for sequential tasks
☆610Updated 4 years ago
google-research / bigbird
Transformers for Longer Sequences
☆620Updated 3 years ago
harvardnlp / pytorch-struct
Fast, general, and tested differentiable structured prediction in PyTorch
☆1,117Updated 3 years ago
abelriboulot / onnxt5
Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.
☆256Updated 2 years ago
nyu-mll / jiant
jiant is an nlp toolkit
☆1,670Updated 2 years ago
facebookresearch / SpanBERT
Code for using and evaluating SpanBERT.
☆899Updated 2 years ago
facebookresearch / XNLI
Evaluating Cross-lingual Sentence Representations
☆458Updated 4 years ago
google-research-datasets / paws
This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…
☆561Updated 3 years ago
GEM-benchmark / NL-Augmenter
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
☆788Updated last year
microsoft / MPNet
MPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf
☆292Updated 4 years ago
google-research / multilingual-t5
☆1,286Updated 2 years ago
sacmehta / delight
DeLighT: Very Deep and Light-Weight Transformers
☆468Updated 5 years ago