bzantium / pytorch-PKD-for-BERT-compression
☆15Updated 5 years ago
Alternatives and similar repositories for pytorch-PKD-for-BERT-compression:
Users that are interested in pytorch-PKD-for-BERT-compression are comparing it to the libraries listed below
- Code for our AAAI2021 paper: Token-Aware Virtual Adversarial Training For Language Understanding.☆25Updated 4 years ago
- pytorch版simcse无监督语义相似模型☆22Updated 3 years ago
- Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"☆47Updated 2 years ago
- ☆66Updated 11 months ago
- ☆12Updated 6 years ago
- ☆21Updated 3 years ago
- ☆44Updated 3 years ago
- ☆28Updated 4 years ago
- Example code for paper "Distilling Knowledge from Well-informed Soft Labels for Neural Relation Extraction" (AAAI 2020).☆22Updated 5 years ago
- ☆10Updated 3 years ago
- Must-read papers on improving efficiency for pre-trained language models.☆103Updated 2 years ago
- Code for ACL 2021 paper: Accelerating BERT Inference for Sequence Labeling via Early-Exit☆28Updated 2 years ago
- ☆50Updated last year
- A PyTorch implementation of "Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation"☆56Updated 5 years ago
- Implementation of AAAI 21 paper: Nested Named Entity Recognition with Partially Observed TreeCRFs☆52Updated 3 years ago
- Implementation of "Curriculum Learning for Natural Language Understanding" (xu et. al. 2020)☆12Updated 4 years ago
- Code for ACL 2023 paper titled "Lifting the Curse of Capacity Gap in Distilling Language Models"☆28Updated last year
- The implementation of the ACL 2020 paper "Learning to Customize Model Structures for Few-shot Dialogue Generation Tasks"☆24Updated 9 months ago
- 🎮 A toolkit for Relation Extraction and more...☆24Updated 6 months ago
- ☆22Updated 3 years ago
- Code for CascadeBERT, Findings of EMNLP 2021☆12Updated 3 years ago
- ☆44Updated 3 years ago
- A paper list of pre-trained language models (PLMs).☆80Updated 3 years ago
- Paradigm shift in natural language processing☆42Updated 2 years ago
- Code for KE-Blender, EMNLP 2021☆19Updated 3 years ago
- Source code and dataset for the paper "GECOR: An End-to-End Generative Ellipsis and Co-reference Resolution Model for Task-Oriented Dialo…☆31Updated last year
- The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach …☆63Updated 4 years ago
- MATCH-TUNING☆15Updated 2 years ago
- ☆18Updated 3 years ago
- Code for ACL 2021 main conference paper "Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dialogue Utterances".☆94Updated 3 years ago