kevinmtian / distill-bertLinks
Knowledge Distillation from BERT
☆53Updated 6 years ago
Alternatives and similar repositories for distill-bert
Users that are interested in distill-bert are comparing it to the libraries listed below
Sorting:
- tensorflow version of bert-of-theseus☆63Updated 4 years ago
- this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large☆65Updated 5 years ago
- ☆61Updated 5 years ago
- First place solution of WSDM CUP 2020, pairwise-bert, lightgbm☆89Updated 5 years ago
- 24*2个预训练的小型BERT模型,NLPer炼丹利器☆51Updated 5 years ago
- EMNLP'19: Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling☆78Updated 2 years ago
- ☆90Updated 5 years ago
- UNF(Universal NLP Framework)☆71Updated 5 years ago
- CLUE baseline pytorch CLUE的pytorch版本基线☆75Updated 5 years ago
- 对ACL2020 FastBERT论文的复现,论文地址//arxiv.org/pdf/2004.02178.pdf☆194Updated 3 years ago
- 中文 预训练 ELECTRA 模型: 基于对抗学习 pretrain Chinese Model☆141Updated 5 years ago
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆94Updated 5 years ago
- Rank2 solution (no-BERT) for 2019 Language and Intelligence Challenge - DuReader2.0 Machine Reading Comprehension.☆128Updated 5 years ago
- Adversarial Attack文本匹配比赛☆42Updated 5 years ago
- The code for "A Unified MRC Framework for Named Entity Recognition"☆33Updated 5 years ago
- Adversarial Training for NLP in Keras☆46Updated 5 years ago
- modification of official bert for downstream task☆32Updated 2 years ago
- transform multi-label classification as sentence pair task, with more training data and information☆178Updated 5 years ago
- 2019 语言与智能技术竞赛-知识驱动对话 B榜第5名源码和模型☆27Updated 6 years ago
- 天池-新冠疫情相似句对判定大赛 Rank8☆52Updated 5 years ago
- TensorFlow implementation of the ESIM model (Enhanced LTSM for natural language inference)☆77Updated 6 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding 论文的中文翻译 Paper Chinese Translation!☆49Updated 5 years ago
- A PyTorch implementation of "Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation"☆56Updated 5 years ago
- ☆50Updated 7 years ago
- The enhanced RCNN model used for sentence similarity classification☆44Updated 4 years ago
- 2019 语言与智能技术竞赛-知识驱动对话 B榜第5名源码和模型☆25Updated 5 years ago
- A Sentence Cloze Dataset for Chinese Machine Reading Comprehension (CMRC 2019)☆127Updated 2 years ago
- 使用BERT解决lic2019机器阅读理解☆90Updated 6 years ago
- [ACL 2020] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering☆121Updated 2 years ago
- Dataset for CIKM 2018 paper "Multi-Source Pointer Network for Product Title Summarization"☆73Updated 6 years ago