lonePatient / MobileBert_PyTorch
MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices
☆68Updated 4 years ago
Alternatives and similar repositories for MobileBert_PyTorch:
Users that are interested in MobileBert_PyTorch are comparing it to the libraries listed below
- pytorch implementation for Patient Knowledge Distillation for BERT Model Compression☆202Updated 5 years ago
- [ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408☆195Updated 2 years ago
- ⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).☆312Updated last year
- R-Drop方法在中文任务上的简单实验☆91Updated 3 years ago
- DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference☆156Updated 3 years ago
- A PyTorch implementation of Transformer in "Attention is All You Need"☆105Updated 4 years ago
- 对ACL2020 FastBERT论文的复现,论文地址//arxiv.org/pdf/2004.02178.pdf☆193Updated 3 years ago
- lightweighted deep learning inference service framework☆39Updated 3 years ago
- ☆251Updated 2 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆91Updated 3 years ago
- Code for the paper "Are Sixteen Heads Really Better than One?"☆171Updated 5 years ago
- Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".☆65Updated 3 years ago
- For the code release of our arXiv paper "Revisiting Few-sample BERT Fine-tuning" (https://arxiv.org/abs/2006.05987).☆184Updated last year
- tensorflow version of bert-of-theseus☆62Updated 4 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆97Updated 2 years ago
- 离线端阅读理解应用 QA for mobile, Android & iPhone☆60Updated 2 years ago
- ☆52Updated 3 years ago
- Knowledge Distillation from BERT☆52Updated 6 years ago
- 论文模型复现☆42Updated 2 years ago
- A general framework for knowledge distillation☆54Updated 4 years ago
- 简洁易用版TinyBert:基于Bert进行知识蒸馏的预训练语言模型☆263Updated 4 years ago
- bert-of-theseus via bert4keras☆31Updated 4 years ago
- Open Source Neural Machine Translation in PyTorch☆17Updated 6 years ago
- This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …☆32Updated last year
- Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"☆47Updated 2 years ago
- UDA(Unsupervised Data Augmentation) implemented by pytorch☆276Updated 5 years ago
- Worth-reading papers and related resources on attention mechanism, Transformer and pretrained language model (PLM) such as BERT. 值得一读的注意力…☆133Updated 4 years ago
- The repo contains the code of the ACL2020 paper `Dice Loss for Data-imbalanced NLP Tasks`☆275Updated last year
- ☆78Updated 2 years ago
- implement bert in pure c++☆36Updated 5 years ago