JetRunner / MetaDistilLinks

Code for ACL 2022 paper "BERT Learns to Teach: Knowledge Distillation with Meta Learning".

☆87

Alternatives and similar repositories for MetaDistil

Users that are interested in MetaDistil are comparing it to the libraries listed below

Sorting:

RunxinXu / ChildTuning
Source code for our EMNLP'21 paper 《Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning》
☆61Updated 4 years ago
qcwthu / Lifelong-Fewshot-Language-Learning
The code for lifelong few-shot language learning
☆55Updated 3 years ago
zhouj8553 / FlipDA
☆67Updated last year
lancopku / DynamicKD
Code for EMNLP 2021 main conference paper "Dynamic Knowledge Distillation for Pre-trained Language Models"
☆41Updated 3 years ago
rabeehk / hyperformer
☆158Updated 4 years ago
AkariAsai / ATTEMPT
This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)
☆103Updated 2 years ago
pkunlp-icler / ChildTuning
☆33Updated 4 years ago
morningmoni / UniPELT
Code for paper "UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning", ACL 2022
☆63Updated 3 years ago
Junya-Chen / FlatCLR
FlatNCE: A Novel Contrastive Representation Learning Objective
☆89Updated 4 years ago
SALT-NLP / IDBR
Codes for the paper: "Continual Learning for Text Classification with Information Disentanglement Based Regularization"
☆44Updated 2 years ago
sh0416 / clrcmd
Official Repository for CLRCMD (Appear in ACL2022)
☆42Updated 2 years ago
thunlp / MixADA
☆21Updated 4 years ago
RunxinXu / ContrastivePruning
Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》
☆25Updated 3 years ago
kugwzk / DiDE
Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”
☆31Updated 2 years ago
eyalbd2 / PADA
Official code for the paper "PADA: Example-based Prompt Learning for on-the-fly Adaptation to Unseen Domains".
☆51Updated 3 years ago
rabeehk / vibert
Implementation for Variational Information Bottleneck for Effective Low-resource Fine-tuning, ICLR 2021
☆41Updated 4 years ago
thunlp / ELLE
☆32Updated 3 years ago
zjunlp / DART
[ICLR 2022] Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners
☆129Updated 2 years ago
OhadRubin / EPR
☆64Updated 2 years ago
fuzihaofzh / AnalyzeParameterEfficientFinetune
On the Effectiveness of Parameter-Efficient Fine-Tuning
☆38Updated 2 years ago
thunlp / TR-BERT
Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"
☆48Updated 3 years ago
lancopku / well-classified-examples-are-underestimated
Code for the AAAI 2022 publication "Well-classified Examples are Underestimated in Classification with Deep Neural Networks"
☆54Updated 3 years ago
GeneZC / MiniMoE
Code for ACL 2023 paper titled "Lifting the Curse of Capacity Gap in Distilling Language Models"
☆29Updated 2 years ago
wzhouad / Contra-OOD
Source code for paper "Contrastive Out-of-Distribution Detection for Pretrained Transformers", EMNLP 2021
☆40Updated 3 years ago
TobiasLee / Awesome-Efficient-PLM
Must-read papers on improving efficiency for pre-trained language models.
☆105Updated 3 years ago
llyx97 / sparse-and-robust-PLM
[NeurIPS 2022] "A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models", Yuanxin Liu, Fandong Meng, Zheng Lin, Jiangnan Li…
☆21Updated last year
thuiar / CRL
Implementation of the research paper Consistent Representation Learning for Continual Relation Extraction (Findings of ACL 2022)
☆26Updated 3 years ago
wutong8023 / PLM4CL
ICLR 2022
☆18Updated 3 years ago
rivercold / BERT-unsupervised-OOD
Code for ACL 2021 paper "Unsupervised Out-of-Domain Detection via Pre-trained Transformers"
☆30Updated 4 years ago
yiren-jian / NonLing-CSE
[NeurIPS 2022] Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings
☆22Updated 2 years ago