KMnP / can
🤔 When in Doubt: Improving Classification Performance with Alternating Normalization [Findings of EMNLP2021]
☆14Updated 3 years ago
Alternatives and similar repositories for can:
Users that are interested in can are comparing it to the libraries listed below
- ☆18Updated 8 months ago
- 基于Transformer的单模型、多尺度的VAE模型☆55Updated 3 years ago
- some strategies for exposure bias in seq2seq☆18Updated 4 years ago
- Official Code for 'EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification' - NAACL 2022☆23Updated 2 years ago
- ☆50Updated last year
- Python下shuffle几百G文件☆33Updated 3 years ago
- 😎 A simple and easy-to-use toolkit for GPU scheduling.☆43Updated 3 years ago
- Code for the ACL2020 paper Character-Level Translation with Self-Attention☆32Updated 4 years ago
- For paper《Gaussian Transformer: A Lightweight Approach for Natural Language Inference》☆28Updated 5 years ago
- 简单的挖矿病毒查杀脚本☆16Updated 3 years ago
- Source code for our EMNLP'21 paper 《Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning》☆59Updated 3 years ago
- A visualizer to display attention weights on text☆23Updated 6 years ago
- K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce (Findings of EMNLP …☆31Updated 2 years ago
- Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"☆32Updated 4 years ago
- Code for EMNLP 2021 main conference paper "Dynamic Knowledge Distillation for Pre-trained Language Models"☆40Updated 2 years ago
- How Does Selective Mechanism Improve Self-attention Networks?☆27Updated 4 years ago
- [NLPCC 2021] Shared Task on AutoIE2: Sub-Event Identification☆14Updated 3 years ago
- ☆86Updated 4 years ago
- PyTorch implementation of the paper "Hyperbolic Interaction Model For Hierarchical Multi-Label Classification"☆48Updated 5 years ago
- bert-of-theseus via bert4keras☆31Updated 4 years ago
- reformer-pytorch中文版本,简单高效的生成模型。类似GPT2的效果☆16Updated last year
- Code for the paper "Scheduled Sampling for Transformers"☆25Updated 5 years ago
- pytorch版simcse无监督语义相似模型☆22Updated 3 years ago
- Code & Data for our Paper "PATTERN-BASED CHINESE HYPERNYM-HYPONYM RELATION EXTRACTION METHOD"☆12Updated 5 years ago
- Code for "Mixed Cross Entropy Loss for Neural Machine Translation"☆20Updated 3 years ago
- Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"☆47Updated 2 years ago
- Official PyTorch implementation of Time-aware Large Kernel (TaLK) Convolutions (ICML 2020)☆29Updated 4 years ago
- DisCo Transformer for Non-autoregressive MT☆77Updated 2 years ago
- [NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning☆93Updated 2 years ago
- FlatNCE: A Novel Contrastive Representation Learning Objective☆90Updated 3 years ago