naist-nlp / mbrsView external linksLinks
A library for minimum Bayes risk (MBR) decoding
☆51Nov 2, 2025Updated 3 months ago
Alternatives and similar repositories for mbrs
Users that are interested in mbrs are comparing it to the libraries listed below
Sorting:
- ☆13Aug 23, 2024Updated last year
- Efficient, Extensible kNN-MT Framework☆18Sep 7, 2024Updated last year
- A repository for experiments in quality-aware decoding☆18Jun 7, 2022Updated 3 years ago
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- Curriculum training☆22Jun 25, 2025Updated 7 months ago
- BLEURT implementation in PyTorch☆37Jan 19, 2023Updated 3 years ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 6 months ago
- Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"☆26Jun 3, 2025Updated 8 months ago
- ☆21Feb 13, 2023Updated 3 years ago
- explainable-machine-translation-metrics☆12Jul 15, 2022Updated 3 years ago
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆18Aug 8, 2024Updated last year
- A tool that locates, downloads, and extracts machine translation corpora☆162Sep 18, 2025Updated 4 months ago
- ☆34Nov 15, 2023Updated 2 years ago
- Unsupervised factor-based text tokenizer for natural-language processing applications☆17Jul 24, 2020Updated 5 years ago
- ☆27Feb 8, 2026Updated last week
- Language model with phrase induction☆14Jun 13, 2019Updated 6 years ago
- Code for EMNLP 2022 main conference paper "Information-Transport-based Policy for Simultaneous Translation"☆13Nov 3, 2022Updated 3 years ago
- [ACL 2024] An easily extensible framework for simultaneous, text-to-text neural machine translation (SimulMT) for LLMs.☆19Apr 21, 2025Updated 9 months ago
- Submissions, baselines and evaluations scripts for the 2nd version of the WebNLG+ Challenge 2020☆14Feb 1, 2022Updated 4 years ago
- ☆17Apr 28, 2022Updated 3 years ago
- Source code for ICLR 2023 spotlight paper "Hidden Markov Transformer for Simultaneous Machine Translation"☆24Dec 11, 2023Updated 2 years ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆126Oct 13, 2025Updated 4 months ago
- SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a t…☆18Feb 22, 2024Updated last year
- ☆46Jul 7, 2025Updated 7 months ago
- A library for semantic similarity search☆26Jan 31, 2025Updated last year
- Joint Source-Target Self Attention with Locality Constraints☆19May 9, 2020Updated 5 years ago
- A curated list of research papers and resources on Cultural LLM.☆53Sep 26, 2024Updated last year
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆28Feb 8, 2023Updated 3 years ago
- ☆22Oct 26, 2020Updated 5 years ago
- A toolkit dedicate for speech evaluation.☆24Sep 26, 2024Updated last year
- A repository of Japanese Phoneme-Level BERT☆22Dec 16, 2023Updated 2 years ago
- An official implementation of "BPE-Dropout: Simple and Effective Subword Regularization" algorithm.☆53Feb 17, 2021Updated 4 years ago
- ☆26Nov 2, 2022Updated 3 years ago
- AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models☆25Sep 26, 2023Updated 2 years ago
- YiSi: A Semantic Machine Translation Evaluation Metric for Evaluating Languages with Different Levels of Available Resources☆26May 28, 2019Updated 6 years ago
- Decoding platform for machine translation research☆54Aug 24, 2019Updated 6 years ago
- Tools for formatting WMT hypothesis and test sets in XML☆27Apr 18, 2025Updated 9 months ago
- it's a train acoustics model code lib☆27May 20, 2020Updated 5 years ago
- XenC: open-source data selection tool for NLP☆64Mar 21, 2016Updated 9 years ago