kevinmtian / distill-bert
Knowledge Distillation from BERT
☆51Updated 6 years ago
Alternatives and similar repositories for distill-bert:
Users that are interested in distill-bert are comparing it to the libraries listed below
- ☆59Updated 5 years ago
- this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large☆65Updated 4 years ago
- tensorflow version of bert-of-theseus☆62Updated 4 years ago
- EMNLP'19: Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling☆77Updated last year
- First place solution of WSDM CUP 2020, pairwise-bert, lightgbm☆89Updated 5 years ago
- CLUE baseline pytorch CLUE的pytorch版本基线☆74Updated 4 years ago
- 24*2个预训练的小型BERT模型,NLPer炼丹利器☆51Updated 4 years ago
- The code for "A Unified MRC Framework for Named Entity Recognition"☆33Updated 5 years ago
- The enhanced RCNN model used for sentence similarity classification☆43Updated 3 years ago
- This is the repository for NLPCC2020 task AutoIE☆51Updated 4 years ago
- CNN、BiLSTM、Bert(3layers)对Bert(12layers)模型的蒸馏的keras实现☆27Updated 4 years ago
- Adversarial Training for NLP in Keras☆46Updated 4 years ago
- ☆47Updated 4 years ago
- ☆78Updated 5 years ago
- A Neural Multi-digraph Model for Chinese NER with Gazetteers☆86Updated 6 months ago
- A PyTorch implementation of "Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation"☆56Updated 4 years ago
- ☆23Updated 5 years ago
- ☆29Updated 5 years ago
- pytorch版bert权重转tf☆21Updated 4 years ago
- Paper: A Simple and Effective Neural Model for Joint Word Segmentation and POS Tagging☆35Updated 5 years ago
- ☆89Updated 4 years ago
- 面向中文领域的轻量文本匹配框架,集成文本匹配,文本蕴含,释义识别等领域的各个经典,STA模型☆25Updated 5 years ago
- Adversarial Attack文本匹配比赛☆42Updated 5 years ago
- lattice lstm cell implementation with tensorflow☆30Updated 6 years ago
- 2019 语言与智能技术竞赛-知识驱动对话 B榜第5名源码和模型☆27Updated 5 years ago
- The very easy BERT pretrain process by using tokenizers and transformers repos☆31Updated 4 years ago
- Baseline for the CNLI corpus☆56Updated 5 years ago
- use google pre-training model bert to fine-tuning for the chinese multiclass classification☆40Updated 6 years ago
- UNF(Universal NLP Framework)☆70Updated 4 years ago
- Hierarchical Neural Relation Extraction☆96Updated 4 years ago