A library for minimum Bayes risk (MBR) decoding
☆52Nov 2, 2025Updated 5 months ago
Alternatives and similar repositories for mbrs
Users that are interested in mbrs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆60Jun 3, 2024Updated last year
- ☆13Aug 23, 2024Updated last year
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 8 months ago
- Curriculum training☆22Jun 25, 2025Updated 9 months ago
- A tool that locates, downloads, and extracts machine translation corpora☆163Updated this week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Unsupervised factor-based text tokenizer for natural-language processing applications☆17Jul 24, 2020Updated 5 years ago
- ☆37Mar 16, 2026Updated last month
- Language model with phrase induction☆14Jun 13, 2019Updated 6 years ago
- Source code for ICLR 2023 spotlight paper "Hidden Markov Transformer for Simultaneous Machine Translation"☆24Dec 11, 2023Updated 2 years ago
- Joint Source-Target Self Attention with Locality Constraints☆20May 9, 2020Updated 5 years ago
- Library for experimenting with state-of-the-art evaluation metrics like UScore☆12May 27, 2023Updated 2 years ago
- A soft and fast pattern matcher for billion-scale corpora.☆75Feb 26, 2025Updated last year
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Mar 11, 2024Updated 2 years ago
- Code for EMNLP 2022 main conference paper "Information-Transport-based Policy for Simultaneous Translation"☆13Nov 3, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A library for semantic similarity search☆26Jan 31, 2025Updated last year
- explainable-machine-translation-metrics☆12Jul 15, 2022Updated 3 years ago
- ☆29Nov 14, 2025Updated 5 months ago
- Implementation of N-Grammer in Flax☆17Nov 3, 2022Updated 3 years ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆128Oct 13, 2025Updated 6 months ago
- Multi-lingual AudioCaps☆12Nov 20, 2023Updated 2 years ago
- 🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…☆12Feb 25, 2025Updated last year
- ☆29Updated this week
- A curated list of research papers and resources on Cultural LLM.☆52Sep 26, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Submissions, baselines and evaluations scripts for the 2nd version of the WebNLG+ Challenge 2020☆13Feb 1, 2022Updated 4 years ago
- a Fairseq fork for sequence tagging/labeling tasks☆32Jun 7, 2020Updated 5 years ago
- ☆16Apr 28, 2022Updated 3 years ago
- ☆10Sep 18, 2021Updated 4 years ago
- Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus☆13Feb 17, 2019Updated 7 years ago
- Language models are open knowledge graphs ( non official implementation )☆13Jan 17, 2021Updated 5 years ago
- [EMNLP 2020] Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆393Nov 7, 2023Updated 2 years ago
- SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a t…☆18Feb 22, 2024Updated 2 years ago
- [ACL 2024] An easily extensible framework for simultaneous, text-to-text neural machine translation (SimulMT) for LLMs.☆18Apr 21, 2025Updated 11 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Repository containing the open source code of works published at the FBK MT unit.☆60Mar 19, 2026Updated 3 weeks ago
- Pre-trained, multilingual sequence-to-sequence models for Indian languages☆51Jul 20, 2022Updated 3 years ago
- ☆34Nov 22, 2021Updated 4 years ago
- ☆33Jul 31, 2024Updated last year
- XenC: open-source data selection tool for NLP☆65Mar 21, 2016Updated 10 years ago
- Tools for formatting WMT hypothesis and test sets in XML☆27Apr 18, 2025Updated last year
- An official implementation of "BPE-Dropout: Simple and Effective Subword Regularization" algorithm.☆53Feb 17, 2021Updated 5 years ago