JDEA-NLP / Vega-MT
[WMT 2022 champion system] Vega-MT model and inference scripts
β40Updated last year
Related projects β
Alternatives and complementary repositories for Vega-MT
- π[ChatGPT4NLU] A Comparative Study on ChatGPT and Fine-tuned BERTβ193Updated last year
- π[ChatGPT4MT] Towards Making the Most of ChatGPT for Machine Translationβ72Updated 7 months ago
- [ChatGPT4MTevaluation] ErrorAnalysis Prompt for MT Evaluation in ChatGPTβ88Updated 10 months ago
- LongMIT: Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasetsβ34Updated last month
- This is a code repository for the ACL 2022 paper "Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Traβ¦β52Updated 2 years ago
- code for Teaching LM to Translate with Comparisonβ37Updated 11 months ago
- Implementation of ICLR 2022 paper "Enhancing Cross-lingual Transfer by Manifold Mixup".β21Updated 2 years ago
- Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning (ICLR 2021)β24Updated 3 years ago
- β14Updated 2 years ago
- Source code for our EMNLP'21 paper γRaise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuningγβ56Updated 3 years ago
- code repo for EMNLP'21 Finding Counter-Interference Adapter for Multilingual Machine Translationβ17Updated 2 years ago
- A benchmark for the task of translation suggestionβ59Updated 2 years ago
- Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Modelsβ22Updated 3 months ago
- [ACL 2022] Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translationβ31Updated last year
- β42Updated last year
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedbackβ38Updated last year
- Implementation of "Glancing Transformer for Non-Autoregressive Neural Machine Translation"β136Updated last year
- Code for paper "Nearest Neighbor Knowledge Distillation for Neural Machine Translation" by Zhixian Yang, Renliang Sun, and Xiaojun Wan. Tβ¦β30Updated 2 years ago
- β163Updated 4 months ago
- Code for ACL 2023 paper titled "Lifting the Curse of Capacity Gap in Distilling Language Models"β28Updated last year
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. πβ12Updated last year
- Code base for "G-Transformer for Document-level Machine Translation"β43Updated last year
- [NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generationβ150Updated last year
- β15Updated 3 years ago
- β59Updated last year
- β80Updated 2 years ago
- This repo contains the code for Late Prompt Tuning.β11Updated last year
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)β97Updated last year
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learningβ38Updated last year
- Code for ACL 2022 paper "BERT Learns to Teach: Knowledge Distillation with Meta Learning".β83Updated 2 years ago