lonePatient / MobileBert_PyTorch
MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices
☆67Updated 4 years ago
Alternatives and similar repositories for MobileBert_PyTorch:
Users that are interested in MobileBert_PyTorch are comparing it to the libraries listed below
- pytorch implementation for Patient Knowledge Distillation for BERT Model Compression☆201Updated 5 years ago
- R-Drop方法在中文任务上的简单实验☆90Updated 3 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆91Updated 3 years ago
- 对ACL2020 FastBERT论文的复现,论文地址//arxiv.org/pdf/2004.02178.pdf☆193Updated 3 years ago
- [ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408☆193Updated last year
- bert-of-theseus via bert4keras☆31Updated 4 years ago
- ⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).☆309Updated last year
- ☆251Updated 2 years ago
- Finetune CPM-1☆74Updated last year
- reformer-pytorch中文版本,简单高效的生成模型。类似GPT2的效果☆16Updated last year
- For the code release of our arXiv paper "Revisiting Few-sample BERT Fine-tuning" (https://arxiv.org/abs/2006.05987).☆184Updated last year
- lightweighted deep learning inference service framework☆39Updated 3 years ago
- PyTorch implementations of algorithms for knowledge distillation.☆57Updated 4 years ago
- adafactor optimizer for keras☆20Updated 3 years ago
- 基于百度webqa与dureader数据集训练的Albert Large QA模型☆75Updated 4 years ago
- bert annotation, input and output for people from scratch, 代码注释, 有每一步的输入和输出, 适合初学者☆93Updated 2 years ago
- tensorflow version of bert-of-theseus☆62Updated 4 years ago
- A PyTorch implementation of the paper - "Synthesizer: Rethinking Self-Attention in Transformer Models"☆72Updated 2 years ago
- lasertagger-chinese;lasertagger中文学习案例,案例数据, 注释,shell运行☆75Updated last year
- Knowledge Distillation from BERT☆52Updated 6 years ago
- Implementation of RealFormer using pytorch☆101Updated 4 years ago
- CTC2021-中文文本纠错大赛的SOTA方案及在线演示☆72Updated last year
- ☆166Updated 3 years ago
- A PyTorch implementation of Transformer in "Attention is All You Need"☆103Updated 4 years ago
- 分享一些S2S在实际应用中遇到的问题和解决方法。☆27Updated 4 years ago
- A PyTorch-based toolkit for natural language processing☆155Updated last year
- A PyTorch implementation of "Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation"☆56Updated 4 years ago
- Chinese MobileBERT(中文MobileBERT模型)☆88Updated 3 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆97Updated 2 years ago
- ☆23Updated 4 years ago