KMnP / can
🤔 When in Doubt: Improving Classification Performance with Alternating Normalization [Findings of EMNLP2021]
☆14Updated 3 years ago
Alternatives and similar repositories for can:
Users that are interested in can are comparing it to the libraries listed below
- 基于Transformer的单模型、多尺度的VAE模型☆55Updated 3 years ago
- some strategies for exposure bias in seq2seq☆18Updated 4 years ago
- ☆50Updated last year
- 简单的挖矿病毒查杀脚本☆16Updated 2 years ago
- Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"☆45Updated 2 years ago
- reformer-pytorch中文版本,简单高效的生成模型。类似GPT2的效果☆16Updated last year
- Python下shuffle几百G文件☆33Updated 3 years ago
- A visualizer to display attention weights on text☆23Updated 5 years ago
- [EMNLP 2021] MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations☆31Updated 2 years ago
- This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …☆32Updated last year
- UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning☆69Updated 3 years ago
- ☆19Updated 2 years ago
- Why Attention? Analyze BiLSTM Deficiency and Its Remedies in the Case of NER☆9Updated 5 years ago
- Source code for "Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation"☆18Updated 5 years ago
- A small framework mimics PyTorch using CuPy or NumPy☆27Updated 3 years ago
- Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"☆32Updated 4 years ago
- [NLPCC 2021] Shared Task on AutoIE2: Sub-Event Identification☆14Updated 3 years ago
- ☆22Updated 3 years ago
- Source code for COLING 2022 paper "Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models"☆24Updated 2 years ago
- Official PyTorch implementation of Time-aware Large Kernel (TaLK) Convolutions (ICML 2020)☆29Updated 4 years ago
- PyTorch implementation of the paper "Hyperbolic Interaction Model For Hierarchical Multi-Label Classification"☆48Updated 5 years ago
- Code for EMNLP 2020 paper CoDIR☆41Updated 2 years ago
- ☆86Updated 4 years ago
- Code for "Mixed Cross Entropy Loss for Neural Machine Translation"☆20Updated 3 years ago
- How Does Selective Mechanism Improve Self-attention Networks?☆27Updated 3 years ago
- [NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning☆93Updated 2 years ago
- A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck☆10Updated 2 years ago
- [ACL‘20] Highway Transformer: A Gated Transformer.☆32Updated 3 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Updated 2 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Updated 2 years ago