Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation
☆16Oct 14, 2022Updated 3 years ago
Alternatives and similar repositories for mbr-nmt
Users that are interested in mbr-nmt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Aug 23, 2024Updated last year
- This repository contains code for the paper "Better Estimation of the KL Divergence Between Language Models"☆18May 30, 2025Updated 9 months ago
- ☆17Aug 30, 2025Updated 6 months ago
- Word sense disambiguation test sets for NMT☆20Dec 3, 2020Updated 5 years ago
- Direct preference optimization with f-divergences.☆16Nov 3, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆60Jun 3, 2024Updated last year
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 2 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆22May 24, 2023Updated 2 years ago
- Larger-Context NMT☆13Aug 20, 2017Updated 8 years ago
- Scripts to preprocess training and test data and to run fast_align and giza☆107Nov 2, 2021Updated 4 years ago
- ☆21Mar 4, 2024Updated 2 years ago
- Myanmar and Thai Language Resources☆10Jul 18, 2022Updated 3 years ago
- The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…☆38Aug 29, 2025Updated 6 months ago
- An Empirical Study of Memorization in NLP (ACL 2022)☆13Jun 22, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Tools for formatting WMT hypothesis and test sets in XML☆27Apr 18, 2025Updated 11 months ago
- This repo contains code to reproduce some of the results presented in the paper "SentenceMIM: A Latent Variable Language Model"☆28Jun 22, 2022Updated 3 years ago
- TIFMO: Textual Inference Forward-chaining MOdule☆12Apr 25, 2014Updated 11 years ago
- Implementation of ICLR 2020 paper "Revisiting Self-Training for Neural Sequence Generation"☆46Jun 30, 2022Updated 3 years ago
- ☆11Aug 26, 2021Updated 4 years ago
- Training a sign language detection model☆11Aug 26, 2022Updated 3 years ago
- Efficient Memory-Augmented Transformers☆35Dec 5, 2022Updated 3 years ago
- 🪱 PARASITE || A parallel sentence data preprocessing toolkit. Originally developed as a part of the `en-ru` winner submission of WMT20 B…☆11Jun 8, 2021Updated 4 years ago
- scripts used for SMT system submitted to WMT 2014☆12Apr 30, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Oct 11, 2023Updated 2 years ago
- A set of command-line tools to preprocess videos for sign language analysis☆14Aug 20, 2025Updated 7 months ago
- MAMMOTH: MAssively Multilingual Modular Open Translation @ Helsinki☆32Updated this week
- Research into identifying and correcting incorrect labels in the CoNLL-2003 corpus.☆12May 11, 2021Updated 4 years ago
- Code for the paper "Understanding the Mechanics of SPIGOT: Surrogate Gradients for Latent Structure Learning"☆11May 5, 2021Updated 4 years ago
- Training scripts and recipes for Sockeye Neural Machine Translation toolkit☆37Sep 8, 2019Updated 6 years ago
- a ducttape workflow for neural machine translation☆14Mar 23, 2021Updated 5 years ago
- [ACL 2023] Contextual Distortion Reveals Constituency: Mask Language Models are Implicit Parsers.☆14Jun 3, 2023Updated 2 years ago
- ☆51Jan 31, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- .files☆14Oct 12, 2025Updated 5 months ago
- Training code for Sparse Autoencoders on Embedding models☆39Feb 27, 2025Updated last year
- Official repo to On the Generalization Ability of Retrieval-Enhanced Transformers☆48Jun 4, 2024Updated last year
- Graphically structured diffusion model.☆21Jun 16, 2023Updated 2 years ago
- ☆18Apr 27, 2017Updated 8 years ago
- Add noise to your text, can be used to improve synthetic training corpus for Neural Machine Translation☆41Aug 8, 2019Updated 6 years ago
- ☆30Nov 14, 2025Updated 4 months ago