pvgladkov / knowledge-distillation
PyTorch implementations of algorithms for knowledge distillation.
☆56Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for knowledge-distillation
- Knowledge Distillation from BERT☆51Updated 5 years ago
- For the code release of our arXiv paper "Revisiting Few-sample BERT Fine-tuning" (https://arxiv.org/abs/2006.05987).☆184Updated last year
- ☆42Updated 4 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆133Updated last year
- Pytorch-version BERT-flow: One can apply BERT-flow to any PLM within Pytorch framework.☆69Updated 3 years ago
- The pytorch implementation of Get To The Point: Summarization with Pointer-Generator Networks.☆46Updated last year
- A PyTorch implementation of "Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation"☆56Updated 4 years ago
- Dual Co-Matching Network for Machine Reading Comprehension☆38Updated 4 years ago
- ☆50Updated last year
- CIKM 2020: Speaker-Aware BERT for Multi-Turn Response Selection in Retrieval-Based Chatbots☆75Updated 4 years ago
- Source code for our "TitleStylist" paper at ACL 2020☆76Updated 4 months ago
- Pytorch implementation of the methods proposed in **Adversarial Training Methods for Semi-Supervised Text Classification** on IMDB datase…☆41Updated 5 years ago
- SpanNER: Named EntityRe-/Recognition as Span Prediction☆124Updated 2 years ago
- Few-shot binary text classification with Induction Networks and Word2Vec weights initialization☆108Updated 4 months ago
- ☆81Updated 4 years ago
- Library of various Few-Shot Learning frameworks for text classification☆60Updated last year
- Research code for ACL 2020 paper: "Distilling Knowledge Learned in BERT for Text Generation".☆130Updated 3 years ago
- EMNLP'19: Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling☆76Updated last year
- pytorch implementation for Patient Knowledge Distillation for BERT Model Compression☆199Updated 5 years ago
- Joint Slot Filling and Intent Detection via Capsule Neural Networks (ACL'19) https://arxiv.org/abs/1812.09471☆139Updated last year
- ☆29Updated 4 years ago
- ☆67Updated 2 years ago
- Transfer RoBERTa checkpoint for Tensorflow☆16Updated 5 years ago
- Code for ACL 2019 paper: "Searching for Effective Neural Extractive Summarization: What Works and What's Next"☆91Updated 3 years ago
- A Multi-Type Multi-Span Network for Reading Comprehension that Requires Discrete Reasoning☆89Updated 5 years ago
- A PyTorch implementation of Google AI's BERT model provided with Google's pre-trained models, examples and utilities.☆35Updated 5 years ago
- Code for the paper "Efficient Adaption of Pretrained Transformers for Abstractive Summarization"☆71Updated 5 years ago
- The official implementation of EMNLP 2020, "A Simple and Effective Model for Answering Multi-span Questions".☆157Updated last year
- TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition (ACL 2020)☆173Updated 2 years ago
- Code for the paper: GCDT: A Global Context Enhanced Deep Transition Architecture for Sequence Labeling☆68Updated 5 years ago