naist-nlp / mbrsView external linksLinks
A library for minimum Bayes risk (MBR) decoding
☆51Nov 2, 2025Updated 3 months ago
Alternatives and similar repositories for mbrs
Users that are interested in mbrs are comparing it to the libraries listed below
Sorting:
- ☆13Aug 23, 2024Updated last year
- Efficient, Extensible kNN-MT Framework☆18Sep 7, 2024Updated last year
- A repository for experiments in quality-aware decoding☆18Jun 7, 2022Updated 3 years ago
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆60Jun 3, 2024Updated last year
- Curriculum training☆22Jun 25, 2025Updated 7 months ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 6 months ago
- BLEURT implementation in PyTorch☆37Jan 19, 2023Updated 3 years ago
- Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"☆26Jun 3, 2025Updated 8 months ago
- NLP2025 のチュートリアル「地理情報と言語処理 実践入門」の資料とソースコード☆17Updated this week
- ☆15Nov 20, 2025Updated 2 months ago
- ☆21Feb 13, 2023Updated 3 years ago
- explainable-machine-translation-metrics☆12Jul 15, 2022Updated 3 years ago
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆18Aug 8, 2024Updated last year
- A tool that locates, downloads, and extracts machine translation corpora☆162Sep 18, 2025Updated 4 months ago
- ☆35Nov 15, 2023Updated 2 years ago
- ☆28Updated this week
- Unsupervised factor-based text tokenizer for natural-language processing applications☆17Jul 24, 2020Updated 5 years ago
- [ACL 2024] An easily extensible framework for simultaneous, text-to-text neural machine translation (SimulMT) for LLMs.☆19Apr 21, 2025Updated 9 months ago
- Code for EMNLP 2022 main conference paper "Information-Transport-based Policy for Simultaneous Translation"☆13Nov 3, 2022Updated 3 years ago
- Language model with phrase induction☆14Jun 13, 2019Updated 6 years ago
- Submissions, baselines and evaluations scripts for the 2nd version of the WebNLG+ Challenge 2020☆14Feb 1, 2022Updated 4 years ago
- A soft and fast pattern matcher for billion-scale corpora.☆75Feb 26, 2025Updated 11 months ago
- ☆17Apr 28, 2022Updated 3 years ago
- Source code for ICLR 2023 spotlight paper "Hidden Markov Transformer for Simultaneous Machine Translation"☆24Dec 11, 2023Updated 2 years ago
- ☆16Dec 18, 2023Updated 2 years ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆126Oct 13, 2025Updated 4 months ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆41Feb 9, 2023Updated 3 years ago
- Repository containing the open source code of works published at the FBK MT unit.☆59Jan 16, 2026Updated last month
- SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a t…☆18Feb 22, 2024Updated last year
- ☆46Jul 7, 2025Updated 7 months ago
- A library for semantic similarity search☆26Jan 31, 2025Updated last year
- A toolkit dedicate for speech evaluation.☆24Sep 26, 2024Updated last year
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆28Feb 8, 2023Updated 3 years ago
- A repository of Japanese Phoneme-Level BERT☆22Dec 16, 2023Updated 2 years ago
- An official implementation of "BPE-Dropout: Simple and Effective Subword Regularization" algorithm.☆53Feb 17, 2021Updated 4 years ago
- ☆26Nov 2, 2022Updated 3 years ago
- AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models☆25Sep 26, 2023Updated 2 years ago
- A Neural Framework for MT Evaluation☆713Feb 5, 2026Updated last week