JetRunner / BERT-of-TheseusView external linksLinks
⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).
☆315Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for BERT-of-Theseus
Users that are interested in BERT-of-Theseus are comparing it to the libraries listed below
Sorting:
- The score code of FastBERT (ACL2020)☆609Oct 29, 2021Updated 4 years ago
- tensorflow version of bert-of-theseus☆63Dec 11, 2020Updated 5 years ago
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,698May 8, 2023Updated 2 years ago
- bert-of-theseus via bert4keras☆31Jul 17, 2020Updated 5 years ago
- pytorch implementation for Patient Knowledge Distillation for BERT Model Compression☆203Sep 20, 2019Updated 6 years ago
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,155Jan 22, 2024Updated 2 years ago
- DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference☆162Mar 25, 2022Updated 3 years ago
- Prune a model while finetuning or training.☆406Jun 21, 2022Updated 3 years ago
- Multi-Task Deep Neural Networks for Natural Language Understanding☆2,257Mar 7, 2024Updated last year
- Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".☆66Jun 19, 2021Updated 4 years ago
- TensorFlow implementation of On the Sentence Embeddings from Pre-trained Language Models (EMNLP 2020)☆535May 19, 2021Updated 4 years ago
- BERT distillation(基于BERT的蒸馏实验 )☆314Jul 30, 2020Updated 5 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,106May 9, 2024Updated last year
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,643Oct 16, 2024Updated last year
- Code for the paper "Are Sixteen Heads Really Better than One?"☆175Apr 1, 2020Updated 5 years ago
- ☆221Jun 8, 2020Updated 5 years ago
- Adversarial Training for Natural Language Understanding☆253Sep 6, 2023Updated 2 years ago
- ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework☆253Jan 7, 2024Updated 2 years ago
- ☆15Sep 10, 2019Updated 6 years ago
- Pre-Trained Chinese XLNet(中文XLNet预训练模型)☆1,649Jul 15, 2025Updated 7 months ago
- a bert for retrieval and generation☆860Feb 26, 2021Updated 4 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,371Mar 23, 2024Updated last year
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,542Jul 18, 2025Updated 6 months ago
- IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization☆12Nov 23, 2021Updated 4 years ago
- RoBERTa中文预训练模型: RoBERTa for Chinese☆2,773Jul 22, 2024Updated last year
- Knowledge Distillation from BERT☆54Jan 7, 2019Updated 7 years ago
- MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices☆71May 19, 2020Updated 5 years ago
- Open Language Pre-trained Model Zoo☆1,004Nov 18, 2021Updated 4 years ago
- BERT-related papers☆2,042Aug 12, 2023Updated 2 years ago
- Method to improve inference time for BERT. This is an implementation of the paper titled "PoWER-BERT: Accelerating BERT Inference via Pro…☆62Sep 17, 2025Updated 4 months ago
- CLUE baseline pytorch CLUE的pytorch版本基线☆75Apr 3, 2020Updated 5 years ago
- 对ACL2020 FastBERT论文的复现,论文地址//arxiv.org/pdf/2004.02178.pdf☆193Dec 15, 2021Updated 4 years ago
- Hierarchically-Refined Label Attention Network for Sequence Labeling☆293Apr 9, 2021Updated 4 years ago
- A BERT-based Chinese Text Encoder Enhanced by N-gram Representations☆647Jul 24, 2022Updated 3 years ago
- using bilstm-crf,bert and other methods to do sequence tagging task☆415Jun 12, 2023Updated 2 years ago
- Longformer: The Long-Document Transformer☆2,186Feb 8, 2023Updated 3 years ago
- DeLighT: Very Deep and Light-Weight Transformers☆469Oct 16, 2020Updated 5 years ago
- EasyTransfer is designed to make the development of transfer learning in NLP applications easier.☆863Aug 25, 2022Updated 3 years ago
- Code for using and evaluating SpanBERT.☆903Jul 25, 2023Updated 2 years ago