KMnP / canLinks
🤔 When in Doubt: Improving Classification Performance with Alternating Normalization [Findings of EMNLP2021]
☆14Updated 4 years ago
Alternatives and similar repositories for can
Users that are interested in can are comparing it to the libraries listed below
Sorting:
- ☆88Updated 5 years ago
- ☆50Updated 2 years ago
- ☆33Updated 4 years ago
- Code for AAAI2021 paper: Few-Shot Learning for Multi-label Intent Detection.☆109Updated 3 years ago
- Source code for our EMNLP'21 paper 《Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning》☆62Updated 4 years ago
- 基于Transformer的单模型、多尺度的VAE模型☆58Updated 4 years ago
- ☆67Updated last year
- ☆73Updated 3 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆98Updated 2 years ago
- [NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning☆94Updated 3 years ago
- FlatNCE: A Novel Contrastive Representation Learning Objective☆90Updated 4 years ago
- Code for our AAAI2021 paper: Token-Aware Virtual Adversarial Training For Language Understanding.☆25Updated 5 years ago
- MASKER: Masked Keyword Regularization for Reliable Text Classification (AAAI 2021)☆54Updated 2 years ago
- WuDaoMM this is a data project☆74Updated 3 years ago
- ☆65Updated 2 years ago
- A PyTorch implementation of "Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation"☆56Updated 5 years ago
- FLASHQuad_pytorch☆68Updated 3 years ago
- pytorch版simcse无监督语义相似模型☆23Updated 4 years ago
- 简单的挖矿病毒查杀脚本☆19Updated 3 years ago
- Official implementation of AAAI-21 paper "Label Confusion Learning to Enhance Text Classification Models"☆119Updated 3 years ago
- Must-read papers on improving efficiency for pre-trained language models.☆105Updated 3 years ago
- some strategies for exposure bias in seq2seq☆18Updated 5 years ago
- Code for ACL 2022 paper "BERT Learns to Teach: Knowledge Distillation with Meta Learning".☆87Updated 3 years ago
- pytorch implementation for Patient Knowledge Distillation for BERT Model Compression☆203Updated 6 years ago
- [ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408☆198Updated 2 years ago
- Worth-reading papers and related resources on attention mechanism, Transformer and pretrained language model (PLM) such as BERT. 值得一读的注意力…☆130Updated 4 years ago
- Python下shuffle几百G文件☆33Updated 4 years ago
- ☆54Updated 3 years ago
- Paper notes for Information Extraction, including Relation Extraction (RE), Named Entity Recognition (NER), Entity Linking (EL), Event Ex…☆17Updated 4 years ago
- Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"☆48Updated 3 years ago