castorini/DeeBERT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/castorini/DeeBERT)

castorini / DeeBERT

DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

☆161

Alternatives and similar repositories for DeeBERT

Users that are interested in DeeBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

autoliuweijie / FastBERT
View on GitHub
The score code of FastBERT (ACL2020)
☆606Oct 29, 2021Updated 4 years ago
castorini / berxit
View on GitHub
☆24Jan 18, 2021Updated 5 years ago
allenai / sledgehammer
View on GitHub
☆48Jun 8, 2020Updated 6 years ago
BitVoyage / FastBERT
View on GitHub
对ACL2020 FastBERT论文的复现，论文地址//arxiv.org/pdf/2004.02178.pdf
☆191Dec 15, 2021Updated 4 years ago
JetRunner / PABEE
View on GitHub
Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".
☆66Jun 19, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
biggsbenjamin / earlyexitnet
View on GitHub
Pytorch-based early exit network inspired by branchynet
☆36May 13, 2025Updated last year
txsun1997 / awesome-early-exiting
View on GitHub
A curated list of Early Exiting papers, benchmarks, and misc.
☆119Oct 26, 2023Updated 2 years ago
IBM / PoWER-BERT
View on GitHub
Method to improve inference time for BERT. This is an implementation of the paper titled "PoWER-BERT: Accelerating BERT Inference via Pro…
☆63Sep 17, 2025Updated 10 months ago
romebert / RomeBERT
View on GitHub
☆16May 6, 2021Updated 5 years ago
yzh119 / BPT
View on GitHub
Source code of paper "BP-Transformer: Modelling Long-Range Context via Binary Partitioning"
☆127Apr 5, 2021Updated 5 years ago
JetRunner / BERT-of-Theseus
View on GitHub
⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).
☆316Jun 12, 2023Updated 3 years ago
studio-ousia / bpr
View on GitHub
Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering
☆175Jun 6, 2021Updated 5 years ago
TurkuNLP / wikibert
View on GitHub
BERT models for many languages created from Wikipedia texts
☆33May 25, 2020Updated 6 years ago
clovaai / length-adaptive-transformer
View on GitHub
Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)
☆102Nov 2, 2020Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
ofirpress / sandwich_transformer
View on GitHub
This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …
☆57Jan 1, 2021Updated 5 years ago
thunlp / TR-BERT
View on GitHub
Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"
☆49May 25, 2022Updated 4 years ago
neulab / RIPPLe
View on GitHub
Code for the paper "Weight Poisoning Attacks on Pre-trained Models" (ACL 2020)
☆142Sep 22, 2025Updated 10 months ago
voidful / NLPrep
View on GitHub
🍳 NLPrep - dataset tool for many natural language processing task
☆28Jul 30, 2021Updated 4 years ago
cdathuraliya / bert-inference
View on GitHub
An attempt to make Google BERT closer to production before Hugging Face Transformers etc.
☆28Sep 10, 2020Updated 5 years ago
facebookresearch / SentAugment
View on GitHub
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…
☆359Feb 22, 2022Updated 4 years ago
huawei-noah / Pretrained-Language-Model
View on GitHub
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
☆3,162Jan 22, 2024Updated 2 years ago
lxk00 / BERT-EMD
View on GitHub
☆50Jun 12, 2023Updated 3 years ago
kunglab / branchynet
View on GitHub
☆138Oct 3, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
neulab / cmu-multinlp
View on GitHub
Generalizing Natural Language Analysis through Span-relation Representations
☆91Sep 22, 2025Updated 10 months ago
facebookresearch / SpanBERT
View on GitHub
Code for using and evaluating SpanBERT.
☆908Jul 25, 2023Updated 2 years ago
laiguokun / Funnel-Transformer
View on GitHub
☆220Jun 8, 2020Updated 6 years ago
princeton-nlp / LM-BFF
View on GitHub
[ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723
☆727Aug 29, 2022Updated 3 years ago
alexa / bort
View on GitHub
Repository for the paper "Optimal Subarchitecture Extraction for BERT"
☆470Jun 22, 2022Updated 4 years ago
yoavg / bert-syntax
View on GitHub
Assessing syntactic abilities of BERT
☆150May 23, 2019Updated 7 years ago
MC-BERT / MC-BERT
View on GitHub
☆99Jul 7, 2020Updated 6 years ago
saurabhkulkarni77 / DistillBERT
View on GitHub
☆61Nov 14, 2019Updated 6 years ago
Andrew-Tierno / QuantizedTransformer
View on GitHub
Implementation of a Quantized Transformer Model
☆20Mar 20, 2019Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
bytedance / effective_transformer
View on GitHub
Running BERT without Padding
☆479Mar 18, 2022Updated 4 years ago
airsplay / vokenization
View on GitHub
PyTorch code for EMNLP 2020 Paper "Vokenization: Improving Language Understanding with Visual Supervision"
☆191Mar 8, 2021Updated 5 years ago
yairf11 / MUPPET
View on GitHub
Code for the paper "multi-hop paragraph retrieval for open-domain question answering"
☆36Jun 21, 2022Updated 4 years ago
pmichel31415 / are-16-heads-really-better-than-1
View on GitHub
Code for the paper "Are Sixteen Heads Really Better than One?"
☆175Apr 1, 2020Updated 6 years ago
dreasysnail / CoCon
View on GitHub
Consistent dialogue generation
☆16Oct 26, 2022Updated 3 years ago
renatoviolin / BERT-cpp-inference
View on GitHub
☆52May 21, 2021Updated 5 years ago
google-research / electra
View on GitHub
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
☆2,367Mar 23, 2024Updated 2 years ago