layer6ai-labs / CMLMCLinks
Code for the ICLR'22 paper "Improving Non-Autoregressive Translation Models Without Distillation"
☆18Updated 3 years ago
Alternatives and similar repositories for CMLMC
Users that are interested in CMLMC are comparing it to the libraries listed below
Sorting:
- Deeply Supervised, Layer-wise Prediction-aware (DSLP) Transformer for Non-autoregressive Neural Machine Translation☆44Updated last year
- Official Implementation for the ICLR2023 paper "Fuzzy Alignments in Directed Acyclic Graph for Non-autoregressive Machine Translation"☆13Updated 2 years ago
- ☆58Updated 3 years ago
- Official Implementation for the ICML2022 paper "Directed Acyclic Transformer for Non-Autoregressive Machine Translation"☆130Updated 2 years ago
- Code for paper "Nearest Neighbor Knowledge Distillation for Neural Machine Translation" by Zhixian Yang, Renliang Sun, and Xiaojun Wan. T…☆32Updated 3 years ago
- Instruction to data diversification☆24Updated 4 years ago
- ☆86Updated 2 years ago
- This is a code repository for the ACL 2022 paper "Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Tra…☆52Updated 3 years ago
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Updated 2 years ago
- Source codes of ACL 2022-Efficient Cluster-based k-Nearest-Neighbor Machine Translation☆26Updated 3 years ago
- Code and data accompanying our ACL 2020 paper, "Unsupervised Domain Clusters in Pretrained Language Models".☆58Updated 5 years ago
- Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning (ICLR 2021)☆24Updated 4 years ago
- ☆23Updated 2 years ago
- Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control☆75Updated 2 years ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆37Updated 2 years ago
- Code for our ACL2021 paper Neural Machine Translation with Monolingual Translation Memory☆82Updated 2 years ago
- ☆15Updated 2 years ago
- ☆38Updated last year
- Implementation of our paper "Self-training Sampling with Monolingual Data Uncertainty for Neural Machine Translation" to appear in ACL-20…☆31Updated 4 years ago
- Code base for "G-Transformer for Document-level Machine Translation"☆45Updated 2 years ago
- Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models☆23Updated last year
- Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃☆114Updated 3 years ago
- Official codebase for “In-Context Learning with Many Demonstration Examples”☆16Updated 2 years ago
- DiffusER: Discrete Diffusion via Edit-based Reconstruction (Reid, Hellendoorn & Neubig, 2022)☆54Updated 2 months ago
- ☆99Updated 3 years ago
- ☆12Updated 4 years ago
- Implementation of latent-GLAT (ACL-2022)☆34Updated 3 years ago
- LongMIT: Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets☆36Updated last year
- A retrieval augmented sequence modeling toolkit implemented based on Fairseq☆29Updated 2 years ago
- ☆38Updated 4 years ago