The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Decoding"
☆37Aug 29, 2025Updated 6 months ago
Alternatives and similar repositories for ContraDecode
Users that are interested in ContraDecode are comparing it to the libraries listed below
Sorting:
- Curriculum training☆22Jun 25, 2025Updated 8 months ago
- Code for "Improving Translation Faithfulness of Large Language Models via Augmenting Instructions"☆12Aug 26, 2023Updated 2 years ago
- Tools for formatting WMT hypothesis and test sets in XML☆27Apr 18, 2025Updated 11 months ago
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 2 years ago
- ☆18Oct 6, 2022Updated 3 years ago
- Dataset and codes for the paper "Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training".☆25Mar 6, 2022Updated 4 years ago
- The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1…☆177Dec 31, 2024Updated last year
- Implementation of our paper "Scaling Back-Translation with Domain Text Generation for Sign Language Gloss Translation". Accepted in EACL …☆11May 22, 2023Updated 2 years ago
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Jul 19, 2023Updated 2 years ago
- 🪱 PARASITE || A parallel sentence data preprocessing toolkit. Originally developed as a part of the `en-ru` winner submission of WMT20 B…☆11Jun 8, 2021Updated 4 years ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆58Feb 3, 2026Updated last month
- Appraise code used as part of WMT21 human evaluation campaign☆30Dec 15, 2025Updated 3 months ago
- ☆21Feb 13, 2023Updated 3 years ago
- OpusFilter - Parallel corpus processing toolkit☆115Feb 11, 2026Updated last month
- Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"☆23Jun 28, 2024Updated last year
- ☆254May 30, 2024Updated last year
- Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation☆16Oct 14, 2022Updated 3 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆106Apr 20, 2024Updated last year
- Code for our ACL2021 paper Neural Machine Translation with Monolingual Translation Memory☆82Jun 12, 2023Updated 2 years ago
- {DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}☆14Jun 18, 2023Updated 2 years ago
- Code and dataset for Polyglot Prompting: Multilingual Multitask Prompt Training.☆18Dec 7, 2022Updated 3 years ago
- Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency …☆10Jul 18, 2023Updated 2 years ago
- Knowledge Graph-augmented NMT☆11Sep 20, 2021Updated 4 years ago
- Cross-lingual Visual Pre-training for Multimodal Machine Translation☆18Dec 28, 2021Updated 4 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆22May 24, 2023Updated 2 years ago
- [EMNLP2022] Source code for Neural Machine Translation with Contrastive Translation Memories☆12Feb 15, 2023Updated 3 years ago
- This is the official code for our paper "Simple and Scalable Nearest Neighbor Machine Translation" (ICLR 2023).☆14Nov 22, 2023Updated 2 years ago
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆28Feb 8, 2023Updated 3 years ago
- ☆18Oct 5, 2017Updated 8 years ago
- Official code and data of "3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset"☆12Dec 8, 2024Updated last year
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- Code for LLM_Catastrophic_Forgetting via SAM.☆11Jun 7, 2024Updated last year
- Zero -- A neural machine translation system☆153May 8, 2023Updated 2 years ago
- ☆15Dec 8, 2022Updated 3 years ago
- Code for EMNLP 2022 main conference paper "Information-Transport-based Policy for Simultaneous Translation"☆13Nov 3, 2022Updated 3 years ago
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- Implementation of ICLR 2020 paper "Revisiting Self-Training for Neural Sequence Generation"☆46Jun 30, 2022Updated 3 years ago
- SLTUNET: A Simple Unified Model for Sign Language Translation (ICLR 2023)☆36Jul 10, 2023Updated 2 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆163Sep 18, 2025Updated 6 months ago