Knowledge Distillation from BERT
☆54Jan 7, 2019Updated 7 years ago
Alternatives and similar repositories for distill-bert
Users that are interested in distill-bert are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- BERT distillation(基于BERT的蒸馏实验 )☆317Jul 30, 2020Updated 5 years ago
- bert蒸馏实践,包含BiLSTM蒸馏BERT和TinyBert☆13Apr 23, 2022Updated 4 years ago
- ☆61Nov 14, 2019Updated 6 years ago
- My toy model for natural language inference task.☆11Aug 6, 2018Updated 7 years ago
- UNF(Universal NLP Framework)☆71Mar 6, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Knowledge distillation in text classification with pytorch. 知识蒸馏,中文文本分类,教师模型BERT、XLNET,学生模型biLSTM。☆232Jul 27, 2022Updated 3 years ago
- ☆277Dec 8, 2020Updated 5 years ago
- Knowledge Distillation For Transformer Language Models☆54Jan 3, 2024Updated 2 years ago
- pytorch implementation for Patient Knowledge Distillation for BERT Model Compression☆203Sep 20, 2019Updated 6 years ago
- ⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).☆316Jun 12, 2023Updated 3 years ago
- Machine Reading Comprehension Leadboard Summary☆12Jan 4, 2021Updated 5 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- CNN、BiLSTM、Bert(3layers)对Bert(12layers)模型的蒸馏的keras实现☆29Mar 3, 2020Updated 6 years ago
- knowledge distillation on BERT☆29Apr 11, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Deep Unknown Intent Detection with Margin Loss (ACL2019)☆35Dec 8, 2022Updated 3 years ago
- this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large☆67Mar 30, 2020Updated 6 years ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆21Jul 31, 2023Updated 2 years ago
- 复现论文《Distilling Task-Specific Knowledge from BERT into Simple Neural Networks》☆16Jun 13, 2021Updated 5 years ago
- Deep contextualized word representations for Chinese☆152Nov 21, 2019Updated 6 years ago
- siamese dssm sentence_similarity sentece_similarity_rank tensorflow☆60Dec 6, 2018Updated 7 years ago
- 对ACL2020 FastBERT论文的复现,论文地址//arxiv.org/pdf/2004.02178.pdf☆192Dec 15, 2021Updated 4 years ago
- The score code of FastBERT (ACL2020)☆607Oct 29, 2021Updated 4 years ago
- Open Language Pre-trained Model Zoo☆1,005Nov 18, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 中文bigbird预训练模型☆96Jul 5, 2022Updated 3 years ago
- Bert finetune for CMRC2018, CJRC, DRCD, CHID, C3☆185Jun 4, 2020Updated 6 years ago
- (NeurIPS 2022) Fast Bayesian Inference with Batch Bayesian Quadrature via Kernel Recombination☆19Nov 16, 2023Updated 2 years ago
- tf&torch about nlp☆11Aug 12, 2022Updated 3 years ago
- 这是一个slot filling任务的预处理工具☆21Jan 5, 2023Updated 3 years ago
- NLP中文预训练模型泛化能力挑战赛☆42Dec 11, 2020Updated 5 years ago
- Source code for the ACL'2025 paper titled "Unveiling privacy risks in llm agent memory"☆33Dec 2, 2025Updated 6 months ago
- transform multi-label classification as sentence pair task, with more training data and information☆178Dec 13, 2019Updated 6 years ago
- 用机器学习算法实现了一种有监督的句子对匹配方法,使用的机器学习分类算法有:逻辑回归(LR)、SVM、GBDT和随机森林(RandomForest),使用的工具是Sklearn。☆29Jun 3, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A simple scheduler that outputs a schedule given a todo list.☆24Nov 22, 2014Updated 11 years ago
- Discovering New Intents via Constrained Deep Adaptive Clustering with Cluster Refinement (AAAI2020)☆46Dec 8, 2022Updated 3 years ago
- 2017 NLPCC DBQA Model (NLPCC比赛)& CCIR (搜狗搜索)& Tools Related☆18Mar 5, 2019Updated 7 years ago
- 第三届魔镜杯 智能客服问题相似性算法设计 第12名解决方案☆148Feb 27, 2019Updated 7 years ago
- An R package to help assess the sensitivity of a Bayesian model (fitted with Stan) to the specification of its likelihood and priors☆11May 29, 2026Updated 2 weeks ago
- PyTorch implementation of the ACL 2019 paper "Improving Question Answering over Incomplete KBs with Knowledge-Aware Reader"☆138Feb 4, 2020Updated 6 years ago
- Lookahead optimizer ("Lookahead Optimizer: k steps forward, 1 step back") for tensorflow☆25Sep 3, 2019Updated 6 years ago