A library for minimum Bayes risk (MBR) decoding
☆52Nov 2, 2025Updated 7 months ago
Alternatives and similar repositories for mbrs
Users that are interested in mbrs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆60Jun 3, 2024Updated 2 years ago
- ☆13Aug 23, 2024Updated last year
- A repository for experiments in quality-aware decoding☆18Jun 7, 2022Updated 4 years ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆15Jul 30, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Curriculum training☆22Jun 25, 2025Updated 11 months ago
- ☆21Feb 13, 2023Updated 3 years ago
- ☆15Nov 20, 2025Updated 6 months ago
- A tool that locates, downloads, and extracts machine translation corpora☆165Apr 13, 2026Updated last month
- Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"☆26Jun 3, 2025Updated last year
- Language model with phrase induction☆14Jun 13, 2019Updated 6 years ago
- Source code for ICLR 2023 spotlight paper "Hidden Markov Transformer for Simultaneous Machine Translation"☆24Dec 11, 2023Updated 2 years ago
- Library for experimenting with state-of-the-art evaluation metrics like UScore☆12May 27, 2023Updated 3 years ago
- A soft and fast pattern matcher for billion-scale corpora.☆75Feb 26, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Mar 11, 2024Updated 2 years ago
- Code for EMNLP 2022 main conference paper "Information-Transport-based Policy for Simultaneous Translation"☆13Nov 3, 2022Updated 3 years ago
- A library for semantic similarity search☆26Jan 31, 2025Updated last year
- explainable-machine-translation-metrics☆12Jul 15, 2022Updated 3 years ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆132Apr 23, 2026Updated last month
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆30Feb 8, 2023Updated 3 years ago
- Submissions, baselines and evaluations scripts for the 2nd version of the WebNLG+ Challenge 2020☆13Feb 1, 2022Updated 4 years ago
- A curated list of research papers and resources on Cultural LLM.☆53Sep 26, 2024Updated last year
- ☆17Apr 28, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆13Mar 25, 2022Updated 4 years ago
- Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus☆13Feb 17, 2019Updated 7 years ago
- Language models are open knowledge graphs ( non official implementation )☆13Jan 17, 2021Updated 5 years ago
- [EMNLP 2020] Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆397Nov 7, 2023Updated 2 years ago
- SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a t…☆18Feb 22, 2024Updated 2 years ago
- [ACL 2024] An easily extensible framework for simultaneous, text-to-text neural machine translation (SimulMT) for LLMs.☆18Apr 21, 2025Updated last year
- Repository containing the open source code of works published at the FBK MT unit.☆60Mar 19, 2026Updated 2 months ago
- ☆33Nov 22, 2021Updated 4 years ago
- ☆33Jul 31, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- it's a train acoustics model code lib☆27May 20, 2020Updated 6 years ago
- XenC: open-source data selection tool for NLP☆65Mar 21, 2016Updated 10 years ago
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year
- [EMNLP2025] Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling☆17Nov 20, 2025Updated 6 months ago
- Tools for formatting WMT hypothesis and test sets in XML☆27Apr 18, 2025Updated last year
- An official implementation of "BPE-Dropout: Simple and Effective Subword Regularization" algorithm.☆53Feb 17, 2021Updated 5 years ago
- Discovery of Rhyme Schemes in Poetry☆17Nov 22, 2011Updated 14 years ago