JetRunner / PABEELinks

Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".

☆65

Alternatives and similar repositories for PABEE

Users that are interested in PABEE are comparing it to the libraries listed below

Sorting:

intersun / CoDIR
Code for EMNLP 2020 paper CoDIR
☆41Updated 3 years ago
QData / TextAttack-Search-Benchmark
EMNLP BlackBox NLP 2020: Searching for a Search Method: Benchmarking Search Algorithms for Generating NLP Adversarial Examples
☆25Updated 5 years ago
IBM / PoWER-BERT
Method to improve inference time for BERT. This is an implementation of the paper titled "PoWER-BERT: Accelerating BERT Inference via Pro…
☆62Updated 2 months ago
UriSha / EmbeddinglessNMT
The implementation of "Neural Machine Translation without Embeddings", NAACL 2021
☆33Updated 4 years ago
CAMTL / CA-MTL
Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data
☆57Updated 4 years ago
jxhe / efficient-knnlm
Pytorch implementation of paper "Efficient Nearest Neighbor Language Models" (EMNLP 2021)
☆74Updated 3 years ago
allenai / sledgehammer
☆48Updated 5 years ago
ethanjperez / true_few_shot
Code for the paper "True Few-Shot Learning in Language Models" (https://arxiv.org/abs/2105.11447)
☆144Updated 4 years ago
microsoft / COCO-LM
[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining
☆118Updated 2 years ago
TevenLeScao / pet
This repository contains the code for "How many data points is a prompt worth?"
☆48Updated 4 years ago
yxuansu / TaCL
[NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning
☆93Updated 3 years ago
nng555 / ssmba
☆62Updated 3 years ago
thunlp / TR-BERT
Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"
☆48Updated 3 years ago
pmichel31415 / are-16-heads-really-better-than-1
Code for the paper "Are Sixteen Heads Really Better than One?"
☆173Updated 5 years ago
princeton-nlp / OptiPrompt
[NAACL 2021] Factual Probing Is [MASK]: Learning vs. Learning to Recall https://arxiv.org/abs/2104.05240
☆167Updated 3 years ago
microsoft / REINA
☆117Updated 3 years ago
tanyuqian / ctc-gen-eval
EMNLP 2021 - CTC: A Unified Framework for Evaluating Natural Language Generation
☆98Updated 2 years ago
bloodwass / mixout
Implementation of Mixout with PyTorch
☆75Updated 2 years ago
microsoft / LiST
Lite Self-Training
☆29Updated 2 years ago
THUDM / FewNLU
☆67Updated 4 years ago
VITA-Group / BERT-Tickets
[NeurIPS 2020] "The Lottery Ticket Hypothesis for Pre-trained BERT Networks", Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Ya…
☆141Updated 3 years ago
castorini / DeeBERT
DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference
☆160Updated 3 years ago
amazon-science / transformers-data-augmentation
Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper
☆51Updated 2 years ago
princeton-nlp / MADE
EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering
☆69Updated 3 years ago
MichaelZhouwang / Sequence_Span_Rewriting
Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting
☆17Updated 3 years ago
fuzihaofzh / repetition-problem-nlg
Code for the paper "A Theoretical Analysis of the Repetition Problem in Text Generation" in AAAI 2021.
☆56Updated 3 years ago
thunlp / Knowledge-Inheritance
Source code for paper: Knowledge Inheritance for Pre-trained Language Models
☆38Updated 3 years ago
nuaa-nlp / Multimodality
☆15Updated 3 years ago
cambridgeltl / mirror-bert
[EMNLP'21] Mirror-BERT: Converting Pretrained Language Models to universal text encoders without labels.
☆78Updated 3 years ago
bigscience-workshop / architecture-objective
☆98Updated 2 years ago