alexa / bortLinks

Repository for the paper "Optimal Subarchitecture Extraction for BERT"

☆473

Alternatives and similar repositories for bort

Users that are interested in bort are comparing it to the libraries listed below

Sorting:

microsoft / fastformers
FastFormers - highly efficient transformer models for NLU
☆705Updated 4 months ago
microsoft / fastseq
An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…
☆433Updated 2 years ago
facebookresearch / SentAugment
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…
☆361Updated 3 years ago
google-research / lasertagger
☆604Updated last month
google-research / xtreme
XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…
☆646Updated 2 years ago
graykode / ALBERT-Pytorch
Pytorch Implementation of ALBERT(A Lite BERT for Self-supervised Learning of Language Representations)
☆226Updated 4 years ago
richarddwang / electra_pytorch
Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)
☆330Updated last year
kwonmha / bert-vocab-builder
Builds wordpiece(subword) vocabulary compatible for Google Research's BERT
☆229Updated 4 years ago
JetRunner / BERT-of-Theseus
⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).
☆313Updated 2 years ago
facebookresearch / XNLI
Evaluating Cross-lingual Sentence Representations
☆456Updated 3 years ago
microsoft / MPNet
MPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf
☆294Updated 3 years ago
facebookresearch / SpanBERT
Code for using and evaluating SpanBERT.
☆899Updated 2 years ago
yitu-opensource / ConvBert
☆251Updated 2 years ago
google-research / byt5
☆514Updated last year
glample / fastBPE
Fast BPE
☆670Updated last year
nyu-dl / bert-gen
☆323Updated 2 years ago
asyml / texar-pytorch
Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CAS…
☆746Updated 3 years ago
allenai / dont-stop-pretraining
Code associated with the Don't Stop Pretraining ACL 2020 paper
☆532Updated 3 years ago
bhoov / exbert
A Visual Analysis Tool to Explore Learned Representations in Transformers Models
☆597Updated last year
LiyuanLucasLiu / Transformer-Clinic
Understanding the Difficulty of Training Transformers
☆329Updated 3 years ago
abelriboulot / onnxt5
Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.
☆255Updated 2 years ago
roomylee / nlp-papers-with-arxiv
Statistics and accepted paper list of NLP conferences with arXiv link
☆431Updated 4 years ago
graykode / xlnet-Pytorch
Simple XLNet implementation with Pytorch Wrapper
☆580Updated 6 years ago
princeton-nlp / DensePhrases
[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…
☆604Updated 3 years ago
kamalkraj / ALBERT-TF2.0
ALBERT model Pretraining and Fine Tuning using TF2.0
☆202Updated 2 years ago
google-research-datasets / paws
This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…
☆559Updated 3 years ago
laiguokun / Funnel-Transformer
☆218Updated 5 years ago
renatoviolin / xlnet
XLNet: fine tuning on RTX 2080 GPU - 8 GB
☆154Updated 6 years ago
facebookresearch / unlikelihood_training
Neural Text Generation with Unlikelihood Training
☆309Updated 3 years ago
mtreviso / linear-chain-crf
Implementation of a linear-chain CRF in PyTorch
☆97Updated 4 years ago