pvgladkov / knowledge-distillation
PyTorch implementations of algorithms for knowledge distillation.
☆57Updated 4 years ago
Alternatives and similar repositories for knowledge-distillation:
Users that are interested in knowledge-distillation are comparing it to the libraries listed below
- A PyTorch implementation of "Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation"☆56Updated 5 years ago
- ☆42Updated 4 years ago
- ☆66Updated 2 years ago
- Knowledge Distillation from BERT☆52Updated 6 years ago
- CIKM 2020: Speaker-Aware BERT for Multi-Turn Response Selection in Retrieval-Based Chatbots☆74Updated 4 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆120Updated 4 years ago
- Code for the paper: GCDT: A Global Context Enhanced Deep Transition Architecture for Sequence Labeling☆67Updated 5 years ago
- Pytorch-version BERT-flow: One can apply BERT-flow to any PLM within Pytorch framework.☆72Updated 3 years ago
- Code and Data for SIGIR 2020 Paper "Few-Shot Generative Conversational Query Rewriting"☆65Updated last year
- Selections from EMNLP 2020☆59Updated 3 years ago
- ☆28Updated 5 years ago
- AAAI-2021 paper: Unsupervised Summarization for Chat Logs with Topic-Oriented Ranking and Context-Aware Auto-Encoders.☆38Updated 3 years ago
- Named Entity Recognition as Dependency Parsing☆39Updated 4 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆135Updated last year
- ☆66Updated 3 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆91Updated 3 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆150Updated 4 years ago
- BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from Transformer☆101Updated 4 years ago
- 1. Pretrain Albert on custom corpus 2. Finetune the pretrained Albert model on downstream task☆33Updated 4 years ago
- ☆25Updated 4 years ago
- Implementation of Self-adjusting Dice Loss from "Dice Loss for Data-imbalanced NLP Tasks" paper☆107Updated 4 years ago
- Code for "A Unified Model for Joint Chinese Word Segmentation and Dependency Parsing"☆38Updated 2 years ago
- EMNLP'19: Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling☆77Updated 2 years ago
- Multilingual Code-Switching for Zero-Shot Cross-Lingual Intent Prediction and Slot Filling☆10Updated 2 years ago
- This is the source implementation of ACL2019 accepted paper: A Novel Bi-directional Interrelated Model for Joint Intent Detection and Slo…☆102Updated 2 years ago
- reference pytorch code for named entity tagging☆85Updated 5 months ago
- ☆47Updated 4 years ago
- ☆50Updated last year
- This repo contains a PyTorch implementation of a pretrained BERT model for sentence similarity task.☆48Updated 6 years ago
- For the code release of our arXiv paper "Revisiting Few-sample BERT Fine-tuning" (https://arxiv.org/abs/2006.05987).☆183Updated last year