DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference
☆162Mar 25, 2022Updated 4 years ago
Alternatives and similar repositories for DeeBERT
Users that are interested in DeeBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The score code of FastBERT (ACL2020)☆609Oct 29, 2021Updated 4 years ago
- ☆24Jan 18, 2021Updated 5 years ago
- ☆48Jun 8, 2020Updated 5 years ago
- 对ACL2020 FastBERT论文的复现,论文地址//arxiv.org/pdf/2004.02178.pdf☆192Dec 15, 2021Updated 4 years ago
- Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".☆66Jun 19, 2021Updated 4 years ago
- Pytorch-based early exit network inspired by branchynet☆36May 13, 2025Updated 10 months ago
- A curated list of Early Exiting papers, benchmarks, and misc.☆119Oct 26, 2023Updated 2 years ago
- Method to improve inference time for BERT. This is an implementation of the paper titled "PoWER-BERT: Accelerating BERT Inference via Pro…☆62Sep 17, 2025Updated 6 months ago
- ☆16May 6, 2021Updated 4 years ago
- Source code of paper "BP-Transformer: Modelling Long-Range Context via Binary Partitioning"☆127Apr 5, 2021Updated 4 years ago
- ⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).☆315Jun 12, 2023Updated 2 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆175Jun 6, 2021Updated 4 years ago
- 基于提前退出部分样本原理而实现的带分支网络(supported by chainer)☆45Apr 23, 2019Updated 6 years ago
- BERT models for many languages created from Wikipedia texts☆33May 25, 2020Updated 5 years ago
- Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)☆102Nov 2, 2020Updated 5 years ago
- Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"☆48May 25, 2022Updated 3 years ago
- This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …☆57Jan 1, 2021Updated 5 years ago
- Confident Adaptive Transformers☆14Apr 18, 2021Updated 4 years ago
- 🍳 NLPrep - dataset tool for many natural language processing task☆28Jul 30, 2021Updated 4 years ago
- Code for the paper "Weight Poisoning Attacks on Pre-trained Models" (ACL 2020)☆142Sep 22, 2025Updated 6 months ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- An attempt to make Google BERT closer to production before Hugging Face Transformers etc.☆28Sep 10, 2020Updated 5 years ago
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,155Jan 22, 2024Updated 2 years ago
- ☆50Jun 12, 2023Updated 2 years ago
- ☆135Oct 3, 2023Updated 2 years ago
- Generalizing Natural Language Analysis through Span-relation Representations☆91Sep 22, 2025Updated 6 months ago
- Code for using and evaluating SpanBERT.☆906Jul 25, 2023Updated 2 years ago
- [ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723☆729Aug 29, 2022Updated 3 years ago
- Repository for the paper "Optimal Subarchitecture Extraction for BERT"☆470Jun 22, 2022Updated 3 years ago
- Implementation of a Quantized Transformer Model☆19Mar 20, 2019Updated 7 years ago
- ☆99Jul 7, 2020Updated 5 years ago
- ☆61Nov 14, 2019Updated 6 years ago
- Running BERT without Padding☆479Mar 18, 2022Updated 4 years ago
- PyTorch code for EMNLP 2020 Paper "Vokenization: Improving Language Understanding with Visual Supervision"☆191Mar 8, 2021Updated 5 years ago
- Consistent dialogue generation☆16Oct 26, 2022Updated 3 years ago
- Code for the paper "multi-hop paragraph retrieval for open-domain question answering"☆36Jun 21, 2022Updated 3 years ago
- Code for the paper "Are Sixteen Heads Really Better than One?"☆175Apr 1, 2020Updated 5 years ago
- ☆52May 21, 2021Updated 4 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,370Mar 23, 2024Updated 2 years ago