pvgladkov / knowledge-distillationLinks
PyTorch implementations of algorithms for knowledge distillation.
☆57Updated 5 years ago
Alternatives and similar repositories for knowledge-distillation
Users that are interested in knowledge-distillation are comparing it to the libraries listed below
Sorting:
- Code for paper "Hierarchical Text Classification with Reinforced Label Assignment" EMNLP 2019☆139Updated 3 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆132Updated 2 years ago
- ☆42Updated 4 years ago
- Implementation of Self-adjusting Dice Loss from "Dice Loss for Data-imbalanced NLP Tasks" paper☆108Updated 4 years ago
- CIKM 2020: Speaker-Aware BERT for Multi-Turn Response Selection in Retrieval-Based Chatbots☆74Updated 4 years ago
- For the code release of our arXiv paper "Revisiting Few-sample BERT Fine-tuning" (https://arxiv.org/abs/2006.05987).☆184Updated 2 years ago
- A PyTorch implementation of "Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation"☆56Updated 5 years ago
- MTM☆142Updated 2 years ago
- Pytorch-version BERT-flow: One can apply BERT-flow to any PLM within Pytorch framework.☆72Updated 3 years ago
- IEEE/ACM TASLP 2020: SBERT-WK: A Sentence Embedding Method By Dissecting BERT-based Word Models☆179Updated 4 years ago
- Source code for our "TitleStylist" paper at ACL 2020☆76Updated 11 months ago
- Knowledge Distillation from BERT☆52Updated 6 years ago
- TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition (ACL 2020)☆172Updated 3 years ago
- Selections from EMNLP 2020☆59Updated 4 years ago
- SpanNER: Named EntityRe-/Recognition as Span Prediction☆131Updated 3 years ago
- Named Entity Recognition as Dependency Parsing☆39Updated 5 years ago
- Semantics-aware BERT for Language Understanding (AAAI 2020)☆288Updated 2 years ago
- ☆25Updated 5 years ago
- Research code for ACL 2020 paper: "Distilling Knowledge Learned in BERT for Text Generation".☆131Updated 4 years ago
- X-Transformer: Taming Pretrained Transformers for eXtreme Multi-label Text Classification☆138Updated 4 years ago
- Fine-tune BERT to generate sentence embedding for cosine similarity☆69Updated 5 years ago
- reference pytorch code for named entity tagging☆86Updated 8 months ago
- Joint Slot Filling and Intent Detection via Capsule Neural Networks (ACL'19) https://arxiv.org/abs/1812.09471☆139Updated 2 years ago
- Few-shot binary text classification with Induction Networks and Word2Vec weights initialization☆108Updated 11 months ago
- ☆67Updated 3 years ago
- Repository for the paper "Fast and Accurate Deep Bidirectional Language Representations for Unsupervised Learning"☆109Updated 4 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆152Updated 4 years ago
- BERT for joint intent classification and slot filling☆39Updated 5 years ago
- Code for the paper: GCDT: A Global Context Enhanced Deep Transition Architecture for Sequence Labeling☆66Updated 5 years ago
- Code for the paper "Efficient Adaption of Pretrained Transformers for Abstractive Summarization"☆71Updated 6 years ago