kevinmtian / distill-bertView external linksLinks
Knowledge Distillation from BERT
☆54Jan 7, 2019Updated 7 years ago
Alternatives and similar repositories for distill-bert
Users that are interested in distill-bert are comparing it to the libraries listed below
Sorting:
- BERT distillation(基于BERT的蒸馏实验 )☆314Jul 30, 2020Updated 5 years ago
- An Unofficial Implementation for Distilling Task-Specific Knowledge from BERT into Simple Neural Networks☆10Jul 25, 2024Updated last year
- My toy model for natural language inference task.☆11Aug 6, 2018Updated 7 years ago
- Machine Reading Comprehension Leadboard Summary☆12Jan 4, 2021Updated 5 years ago
- Knowledge Distillation For Transformer Language Models☆54Jan 3, 2024Updated 2 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- ☆279Dec 8, 2020Updated 5 years ago
- CNN、BiLSTM、Bert(3layers)对Bert(12layers)模型的蒸馏的keras实现☆29Mar 3, 2020Updated 5 years ago
- ☆12Jul 7, 2021Updated 4 years ago
- pytorch implementation for Patient Knowledge Distillation for BERT Model Compression☆203Sep 20, 2019Updated 6 years ago
- ☆14Dec 25, 2017Updated 8 years ago
- Knowledge distillation in text classification with pytorch. 知识蒸馏,中文文本分类,教师模型BERT、XLNET,学生模型biLSTM。☆229Jul 27, 2022Updated 3 years ago
- end-to-end dialog system dataset☆13Sep 15, 2019Updated 6 years ago
- ⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).☆315Jun 12, 2023Updated 2 years ago
- TensorFlow code and pre-trained models for BERT☆12Mar 19, 2019Updated 6 years ago
- 复现论文《Distilling Task-Specific Knowledge from BERT into Simple Neural Networks》☆16Jun 13, 2021Updated 4 years ago
- Self-Distillation with weighted ground-truth targets; ResNet and Kernel Ridge Regression☆19Oct 12, 2021Updated 4 years ago
- this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large☆66Mar 30, 2020Updated 5 years ago
- Deep contextualized word representations for Chinese☆151Nov 21, 2019Updated 6 years ago
- Code for KDD CUP 2019 Auto-ML track☆21Jul 25, 2019Updated 6 years ago
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,698May 8, 2023Updated 2 years ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆21Jul 31, 2023Updated 2 years ago
- (NeurIPS 2022) Fast Bayesian Inference with Batch Bayesian Quadrature via Kernel Recombination☆19Nov 16, 2023Updated 2 years ago
- transform multi-label classification as sentence pair task, with more training data and information☆178Dec 13, 2019Updated 6 years ago
- Pytorch implementation of the methods proposed in **Adversarial Training Methods for Semi-Supervised Text Classification** on IMDB datase…☆44Mar 31, 2019Updated 6 years ago
- soft_mask_bert model for Chinese Spelling Correction in keras☆25Nov 3, 2020Updated 5 years ago
- Open Language Pre-trained Model Zoo☆1,004Nov 18, 2021Updated 4 years ago
- UNF(Universal NLP Framework)☆71Mar 6, 2020Updated 5 years ago
- Negative sampling for solving the unlabeled entity problem in NER. ICLR-2021 paper: Empirical Analysis of Unlabeled Entity Problem in Nam…☆134Feb 26, 2022Updated 3 years ago
- ☆25Jun 11, 2023Updated 2 years ago
- 中文bigbird预训练模型☆96Jul 5, 2022Updated 3 years ago
- 对ACL2020 FastBERT论文的复现,论文地址//arxiv.org/pdf/2004.02178.pdf☆193Dec 15, 2021Updated 4 years ago
- Discovering New Intents via Constrained Deep Adaptive Clustering with Cluster Refinement (AAAI2020)☆46Dec 8, 2022Updated 3 years ago
- NLP中文预训练模型泛化能力挑战赛☆42Dec 11, 2020Updated 5 years ago
- 2019达观杯实体识别☆19Sep 12, 2019Updated 6 years ago
- ☆71Jun 28, 2022Updated 3 years ago
- gensim-fast2vec改造、灵活使用大规模外部词向量(具备OOV查询能力)☆23Jun 3, 2019Updated 6 years ago
- TensorFlow code and pre-trained models for BERT and ERNIE☆146Jun 5, 2019Updated 6 years ago
- The score code of FastBERT (ACL2020)☆609Oct 29, 2021Updated 4 years ago