⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).
☆315Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for BERT-of-Theseus
Users that are interested in BERT-of-Theseus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- tensorflow version of bert-of-theseus☆63Dec 11, 2020Updated 5 years ago
- The score code of FastBERT (ACL2020)☆608Oct 29, 2021Updated 4 years ago
- bert-of-theseus via bert4keras☆31Jul 17, 2020Updated 5 years ago
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,696May 8, 2023Updated 2 years ago
- pytorch implementation for Patient Knowledge Distillation for BERT Model Compression☆204Sep 20, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,157Jan 22, 2024Updated 2 years ago
- ☆15Sep 10, 2019Updated 6 years ago
- BERT distillation(基于BERT的蒸馏实验 )☆316Jul 30, 2020Updated 5 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,104May 9, 2024Updated last year
- Multi-Task Deep Neural Networks for Natural Language Understanding☆2,257Mar 7, 2024Updated 2 years ago
- Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".☆66Jun 19, 2021Updated 4 years ago
- Code for the paper "Are Sixteen Heads Really Better than One?"☆175Apr 1, 2020Updated 6 years ago
- Prune a model while finetuning or training.☆406Jun 21, 2022Updated 3 years ago
- DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference☆161Mar 25, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices☆71May 19, 2020Updated 5 years ago
- ☆220Jun 8, 2020Updated 5 years ago
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,649Oct 16, 2024Updated last year
- TensorFlow implementation of On the Sentence Embeddings from Pre-trained Language Models (EMNLP 2020)☆535May 19, 2021Updated 4 years ago
- Knowledge Distillation from BERT☆54Jan 7, 2019Updated 7 years ago
- Adversarial Training for Natural Language Understanding☆253Sep 6, 2023Updated 2 years ago
- IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization☆12Nov 23, 2021Updated 4 years ago
- ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework☆253Jan 7, 2024Updated 2 years ago
- Pre-Trained Chinese XLNet(中文XLNet预训练模型)☆1,649Jul 15, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 对ACL2020 FastBERT论文的复现,论文地址//arxiv.org/pdf/2004.02178.pdf☆192Dec 15, 2021Updated 4 years ago
- a bert for retrieval and generation☆860Feb 26, 2021Updated 5 years ago
- Open Language Pre-trained Model Zoo☆1,006Nov 18, 2021Updated 4 years ago
- using bilstm-crf,bert and other methods to do sequence tagging task☆415Jun 12, 2023Updated 2 years ago
- RoBERTa中文预训练模型: RoBERTa for Chinese☆2,783Jul 22, 2024Updated last year
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,545Jul 18, 2025Updated 9 months ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,372Mar 23, 2024Updated 2 years ago
- An Implementation of Bidirectional Attention Flow☆39Sep 6, 2017Updated 8 years ago
- Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"☆48May 25, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for the ACL2021 paper "Lexicon Enhanced Chinese Sequence Labelling Using BERT Adapter"☆345Jan 15, 2022Updated 4 years ago
- transformers implement (architecture, task example, serving and more)☆96Mar 23, 2022Updated 4 years ago
- A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型☆3,984Nov 21, 2022Updated 3 years ago
- Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)☆1,439Jul 15, 2025Updated 9 months ago
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"☆1,627Jun 12, 2023Updated 2 years ago
- BERT-related papers☆2,039Aug 12, 2023Updated 2 years ago
- ☆604Mar 12, 2026Updated last month