pvgladkov / knowledge-distillationLinks
PyTorch implementations of algorithms for knowledge distillation.
☆57Updated 5 years ago
Alternatives and similar repositories for knowledge-distillation
Users that are interested in knowledge-distillation are comparing it to the libraries listed below
Sorting:
- Implementation of Self-adjusting Dice Loss from "Dice Loss for Data-imbalanced NLP Tasks" paper☆109Updated 5 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆136Updated 2 years ago
- ☆42Updated 5 years ago
- Code for paper "Hierarchical Text Classification with Reinforced Label Assignment" EMNLP 2019☆137Updated 4 years ago
- pytorch implementation for Patient Knowledge Distillation for BERT Model Compression☆203Updated 6 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆121Updated 4 years ago
- Few-shot binary text classification with Induction Networks and Word2Vec weights initialization☆109Updated last year
- The repo contains the code of the ACL2020 paper `Dice Loss for Data-imbalanced NLP Tasks`☆274Updated 2 years ago
- For the code release of our arXiv paper "Revisiting Few-sample BERT Fine-tuning" (https://arxiv.org/abs/2006.05987).☆185Updated 2 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆91Updated 4 years ago
- IEEE/ACM TASLP 2020: SBERT-WK: A Sentence Embedding Method By Dissecting BERT-based Word Models☆182Updated 4 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆152Updated 5 years ago
- Named Entity Recognition as Dependency Parsing☆39Updated 5 years ago
- Enhancing the BERT training with Semi-supervised Generative Adversarial Networks☆229Updated 2 years ago
- X-Transformer: Taming Pretrained Transformers for eXtreme Multi-label Text Classification☆141Updated 4 years ago
- A PyTorch implementation of "Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation"☆56Updated 5 years ago
- Research code for ACL 2020 paper: "Distilling Knowledge Learned in BERT for Text Generation".☆129Updated 4 years ago
- Semantics-aware BERT for Language Understanding (AAAI 2020)☆287Updated 3 years ago
- [ACL 2020] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering☆121Updated 2 years ago
- TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition (ACL 2020)☆172Updated 3 years ago
- SpanNER: Named EntityRe-/Recognition as Span Prediction☆131Updated 3 years ago
- ☆121Updated 3 years ago
- Joint Slot Filling and Intent Detection via Capsule Neural Networks (ACL'19) https://arxiv.org/abs/1812.09471☆139Updated 2 years ago
- Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data☆100Updated 2 years ago
- Hierarchical multi-label text classification of the BlurbGenreCollection using capsule networks.☆86Updated 2 years ago
- MTM☆143Updated 3 years ago
- Code for ACL2020 paper: Few-shot Slot Tagging with Collapsed Dependency Transfer and Label-enhanced Task-adaptive Projection Network☆154Updated 3 years ago
- Implementation of Nested Named Entity Recognition using BERT☆136Updated 4 years ago
- Pytorch-version BERT-flow: One can apply BERT-flow to any PLM within Pytorch framework.☆72Updated 4 years ago
- Knowledge Distillation from BERT☆54Updated 7 years ago