Knowledge Distillation from BERT
☆54Jan 7, 2019Updated 7 years ago
Alternatives and similar repositories for distill-bert
Users that are interested in distill-bert are comparing it to the libraries listed below
Sorting:
- BERT distillation(基于BERT的蒸馏实验 )☆314Jul 30, 2020Updated 5 years ago
- ☆61Nov 14, 2019Updated 6 years ago
- bert蒸馏实践,包含BiLSTM蒸馏BERT和TinyBert☆13Apr 23, 2022Updated 3 years ago
- Machine Reading Comprehension Leadboard Summary☆12Jan 4, 2021Updated 5 years ago
- Knowledge Distillation For Transformer Language Models☆54Jan 3, 2024Updated 2 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- ☆279Dec 8, 2020Updated 5 years ago
- CNN、BiLSTM、Bert(3layers)对Bert(12layers)模型的蒸馏的keras实现☆29Mar 3, 2020Updated 6 years ago
- ☆12Jul 7, 2021Updated 4 years ago
- ☆14Dec 25, 2017Updated 8 years ago
- Knowledge distillation in text classification with pytorch. 知识蒸馏,中文文本分类,教师模型BERT、XLNET,学生模型biLSTM。☆229Jul 27, 2022Updated 3 years ago
- end-to-end dialog system dataset☆13Sep 15, 2019Updated 6 years ago
- ⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).☆315Jun 12, 2023Updated 2 years ago
- Self-Distillation with weighted ground-truth targets; ResNet and Kernel Ridge Regression☆19Oct 12, 2021Updated 4 years ago
- this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large☆66Mar 30, 2020Updated 5 years ago
- Deep contextualized word representations for Chinese☆151Nov 21, 2019Updated 6 years ago
- Code for KDD CUP 2019 Auto-ML track☆21Jul 25, 2019Updated 6 years ago
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,696May 8, 2023Updated 2 years ago
- Context-Sensitive Misspelling Correction of Clinical Text via Conditional Independence, CHIL 2022☆17Apr 4, 2024Updated last year
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆21Jul 31, 2023Updated 2 years ago
- transform multi-label classification as sentence pair task, with more training data and information☆178Dec 13, 2019Updated 6 years ago
- Bert finetune for CMRC2018, CJRC, DRCD, CHID, C3☆184Jun 4, 2020Updated 5 years ago
- 2017 NLPCC DBQA Model (NLPCC比赛)& CCIR (搜狗搜索)& Tools Related☆18Mar 5, 2019Updated 7 years ago
- Open Language Pre-trained Model Zoo☆1,005Nov 18, 2021Updated 4 years ago
- UNF(Universal NLP Framework)☆71Mar 6, 2020Updated 6 years ago
- Negative sampling for solving the unlabeled entity problem in NER. ICLR-2021 paper: Empirical Analysis of Unlabeled Entity Problem in Nam…☆134Feb 26, 2022Updated 4 years ago
- ☆25Jun 11, 2023Updated 2 years ago
- [ICML 2021 Oral] "CATE: Computation-aware Neural Architecture Encoding with Transformers" by Shen Yan, Kaiqiang Song, Fei Liu, Mi Zhang☆19Jun 23, 2021Updated 4 years ago
- 中文bigbird预训练模型☆96Jul 5, 2022Updated 3 years ago
- 对ACL2020 FastBERT论文的复现,论文地址//arxiv.org/pdf/2004.02178.pdf☆193Dec 15, 2021Updated 4 years ago
- NLP中文预训练模型泛化能力挑战赛☆42Dec 11, 2020Updated 5 years ago
- 2019达观杯实体识别☆19Sep 12, 2019Updated 6 years ago
- Lite Self-Training☆30Jul 25, 2023Updated 2 years ago
- gensim-fast2vec改造、灵活使用大规模外部词向量(具备OOV查询能力)☆23Jun 3, 2019Updated 6 years ago
- TensorFlow code and pre-trained models for BERT and ERNIE☆146Jun 5, 2019Updated 6 years ago
- The score code of FastBERT (ACL2020)☆609Oct 29, 2021Updated 4 years ago
- 中文预训练模型生成字向量学习,测试BERT,ELMO的中文效果☆100Jan 22, 2020Updated 6 years ago
- 法研杯2019 阅读理解赛道 top3☆151Nov 13, 2023Updated 2 years ago
- AllenNLP model for the Kaggle toxic comments challenge☆32Jul 13, 2018Updated 7 years ago