bzantium / pytorch-PKD-for-BERT-compressionLinks
☆15Updated 6 years ago
Alternatives and similar repositories for pytorch-PKD-for-BERT-compression
Users that are interested in pytorch-PKD-for-BERT-compression are comparing it to the libraries listed below
Sorting:
- ☆67Updated last year
- Code for our AAAI2021 paper: Token-Aware Virtual Adversarial Training For Language Understanding.☆25Updated 5 years ago
- pytorch版simcse无监督语义相似模型☆23Updated 4 years ago
- pytorch implementation for Patient Knowledge Distillation for BERT Model Compression☆204Updated 6 years ago
- Must-read papers on improving efficiency for pre-trained language models.☆105Updated 3 years ago
- A PyTorch implementation of "Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation"☆56Updated 5 years ago
- ☆12Updated 7 years ago
- ☆48Updated 4 years ago
- ACL-2022 paper: Divide and Conquer: Text Semantic Matching with Disentangled Keywords and Intents.☆38Updated 3 years ago
- Code for AAAI2021 paper: Few-Shot Learning for Multi-label Intent Detection.☆109Updated 3 years ago
- Implementation of "Curriculum Learning for Natural Language Understanding" (xu et. al. 2020)☆12Updated 5 years ago
- ☆54Updated 3 years ago
- Pretrain CPM-1☆52Updated 4 years ago
- Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"☆48Updated 3 years ago
- Chinese Machine Reading 2021海华AI挑战赛·中文阅读理解·技术组·第三名☆21Updated 4 years ago
- A paper list of pre-trained language models (PLMs).☆81Updated 4 years ago
- ☆21Updated 4 years ago
- Source code and dataset for the paper "GECOR: An End-to-End Generative Ellipsis and Co-reference Resolution Model for Task-Oriented Dialo…☆30Updated 2 years ago
- Code for ACL 2023 paper titled "Lifting the Curse of Capacity Gap in Distilling Language Models"☆29Updated 2 years ago
- ☆45Updated 4 years ago
- Code for the paper "A Deep Reinforced Sequence-to-Set Model for Multi-Label Classification"☆50Updated 5 years ago
- ☆33Updated 4 years ago
- ☆50Updated 4 years ago
- ☆80Updated 3 years ago
- Good Examples Make A Faster Learner: Simple Demonstration-based Learning for Low-resource NER (ACL 2022)☆44Updated 3 years ago
- Group Meeting Record for Baobao Chang Group in Peking University☆26Updated 4 years ago
- Code for ACL 2021 paper: Accelerating BERT Inference for Sequence Labeling via Early-Exit☆28Updated 3 years ago
- Code for paper "Stylized Dialogue Response Generation Using Stylized Unpaired Texts"☆31Updated 3 years ago
- ☆75Updated 3 years ago
- Code for NAACL2022 Long Paper "An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling"☆28Updated 3 years ago