JetRunner / MetaDistil
Code for ACL 2022 paper "BERT Learns to Teach: Knowledge Distillation with Meta Learning".
☆80Updated 2 years ago
Related projects: ⓘ
- Source code for our EMNLP'21 paper 《Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning》☆56Updated 2 years ago
- Code for the AAAI 2022 publication "Well-classified Examples are Underestimated in Classification with Deep Neural Networks"☆40Updated 2 years ago
- The code for lifelong few-shot language learning☆53Updated 2 years ago
- FlatNCE: A Novel Contrastive Representation Learning Objective☆83Updated 2 years ago
- ☆65Updated 4 months ago
- Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”☆29Updated last year
- Code for EMNLP 2021 main conference paper "Dynamic Knowledge Distillation for Pre-trained Language Models"☆40Updated 2 years ago
- Codes for the paper: "Continual Learning for Text Classification with Information Disentanglement Based Regularization"☆42Updated last year
- ☆32Updated 2 years ago
- Implementation of the research paper Consistent Representation Learning for Continual Relation Extraction (Findings of ACL 2022)☆25Updated 2 years ago
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆96Updated last year
- [NeurIPS 2022] "A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models", Yuanxin Liu, Fandong Meng, Zheng Lin, Jiangnan Li…☆21Updated 8 months ago
- my commonly-used tools☆46Updated last month
- ☆32Updated 2 years ago
- Code for ACL 2021 paper "Unsupervised Out-of-Domain Detection via Pre-trained Transformers"☆30Updated 3 years ago
- Code for our AAAI2021 paper: Token-Aware Virtual Adversarial Training For Language Understanding.☆26Updated 3 years ago
- Advances of few-shot learning, especially for NLP applications.☆29Updated last year
- Implementation of the paper Parameter-Efficient Transfer Learning for NLP, Houlsby [Google], 2019. Published in ICML 2019.☆34Updated last year
- ☆18Updated 5 months ago
- ☆149Updated 3 years ago
- Source code for paper "Contrastive Out-of-Distribution Detection for Pretrained Transformers", EMNLP 2021☆40Updated 2 years ago
- Code for ACL 2023 paper titled "Lifting the Curse of Capacity Gap in Distilling Language Models"☆28Updated last year
- ☆97Updated 2 years ago
- ☆20Updated 3 years ago
- Implementation for Variational Information Bottleneck for Effective Low-resource Fine-tuning, ICLR 2021☆36Updated 3 years ago
- [EMNLP 2022] Differentiable Data Augmentation for Contrastive Sentence Representation Learning. https://arxiv.org/abs/2210.16536☆34Updated last year
- Unofficial Pytorch implementation of MiniLM and MiniLMv2☆19Updated 2 years ago
- ICLR 2022☆17Updated 2 years ago
- ☆56Updated last year
- A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models (ACL 2022)☆40Updated 2 years ago