intersun / PKD-for-BERT-Model-CompressionLinks

pytorch implementation for Patient Knowledge Distillation for BERT Model Compression

☆203

Alternatives and similar repositories for PKD-for-BERT-Model-Compression

Users that are interested in PKD-for-BERT-Model-Compression are comparing it to the libraries listed below

Sorting:

JetRunner / BERT-of-Theseus
⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).
☆316Updated 2 years ago
zhuchen03 / FreeLB
Adversarial Training for Natural Language Understanding
☆254Updated 2 years ago
NoviScl / BERT-RACE
☆79Updated 3 years ago
lonePatient / electra_pytorch
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
☆91Updated 4 years ago
lonePatient / BERT-SDA
A PyTorch implementation of "Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation"
☆56Updated 5 years ago
Sanyuan-Chen / RecAdam
Code for the RecAdam paper: Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less Forgetting.
☆118Updated 5 years ago
qiangsiwei / bert_distill
BERT distillation（基于BERT的蒸馏实验）
☆315Updated 5 years ago
asappresearch / revisit-bert-finetuning
For the code release of our arXiv paper "Revisiting Few-sample BERT Fine-tuning" (https://arxiv.org/abs/2006.05987).
☆184Updated 2 years ago
BitVoyage / FastBERT
对ACL2020 FastBERT论文的复现，论文地址//arxiv.org/pdf/2004.02178.pdf
☆194Updated 3 years ago
kevinmtian / distill-bert
Knowledge Distillation from BERT
☆54Updated 6 years ago
yitu-opensource / ConvBert
☆254Updated 3 years ago
SanghunYun / UDA_pytorch
UDA(Unsupervised Data Augmentation) implemented by pytorch
☆278Updated 5 years ago
pmichel31415 / are-16-heads-really-better-than-1
Code for the paper "Are Sixteen Heads Really Better than One?"
☆173Updated 5 years ago
laohur / Learning-To-Compare-For-Text
Learning To Compare For Text , Few shot learning in text classification
☆42Updated 5 years ago
YujiaBao / Distributional-Signatures
"Few-shot Text Classification with Distributional Signatures" ICLR 2020
☆261Updated 4 years ago
AtmaHou / FewShotMultiLabel
Code for AAAI2021 paper: Few-Shot Learning for Multi-label Intent Detection.
☆109Updated 3 years ago
linzehui / mRASP
☆167Updated 3 years ago
eaglenlp / Text-Matching
☆25Updated 5 years ago
ZhengZixiang / ATPapers
Worth-reading papers and related resources on attention mechanism, Transformer and pretrained language model (PLM) such as BERT. 值得一读的注意力…
☆130Updated 4 years ago
AtmaHou / FewShotTagging
Code for ACL2020 paper: Few-shot Slot Tagging with Collapsed Dependency Transfer and Label-enhanced Task-adaptive Projection Network
☆154Updated 3 years ago
joongbo / tta
Repository for the paper "Fast and Accurate Deep Bidirectional Language Representations for Unsupervised Learning"
☆110Updated 5 years ago
LooperXX / DF-Net
Open source code for ACL 2020 Paper "Dynamic Fusion Network for Multi-Domain End-to-end Task-Oriented Dialog"
☆104Updated 3 years ago
wuzhiye7 / Induction-Network-on-FewRel
An attempt at replicating the Induction Network for FewRel data in Tensorflow
☆178Updated 6 years ago
jcyk / BERT
a simple yet complete implementation of the popular BERT model
☆128Updated 5 years ago
lancopku / Seq2Set
Code for the paper "A Deep Reinforced Sequence-to-Set Model for Multi-Label Classification"
☆50Updated 5 years ago
lxk00 / BERT-EMD
☆50Updated 2 years ago
haoyuhu / bert-multi-gpu
Feel free to fine tune large BERT models with Multi-GPU and FP16 support.
☆192Updated 5 years ago
huminghao16 / thesis
Minghao Hu's thesis on Machine Reading Comprehension
☆37Updated 5 years ago
Gorov / DiverseFewShot_Amazon
☆121Updated 6 years ago
ha-lins / MetaLearning4NLP-Papers
A list of recent papers about Meta / few-shot learning methods applied in NLP areas.
☆231Updated 4 years ago