Minimum Bayes Risk Decoding for Hugging Face Transformers
☆60Jun 3, 2024Updated last year
Alternatives and similar repositories for mbr
Users that are interested in mbr are comparing it to the libraries listed below
Sorting:
- Code for NeurIPS 2023 paper "Non-autoregressive Machine Translation with Probabilistic Context-free Grammar".☆12Jan 4, 2024Updated 2 years ago
- ☆13Aug 23, 2024Updated last year
- A library for minimum Bayes risk (MBR) decoding☆52Nov 2, 2025Updated 4 months ago
- Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation☆16Oct 14, 2022Updated 3 years ago
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- ☆30Nov 14, 2025Updated 3 months ago
- [NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling☆40Dec 2, 2023Updated 2 years ago
- ☆14Apr 29, 2025Updated 10 months ago
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 9 months ago
- MAMMOTH: MAssively Multilingual Modular Open Translation @ Helsinki☆30Updated this week
- Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"☆26Jun 3, 2025Updated 8 months ago
- ☆14Jun 24, 2024Updated last year
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- ☆15Nov 20, 2025Updated 3 months ago
- ☆51Jan 28, 2024Updated 2 years ago
- ☆14Feb 1, 2024Updated 2 years ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated last year
- ☆12Dec 13, 2022Updated 3 years ago
- Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)☆12Mar 7, 2024Updated last year
- Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022)…☆13Sep 8, 2022Updated 3 years ago
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languages☆11Feb 6, 2024Updated 2 years ago
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 2 years ago
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Sep 12, 2023Updated 2 years ago
- Official Implementation for the ICLR2023 paper "Fuzzy Alignments in Directed Acyclic Graph for Non-autoregressive Machine Translation"☆14Mar 1, 2023Updated 3 years ago
- Experiment of using Tangent to autodiff triton☆82Jan 22, 2024Updated 2 years ago
- Course for Interpreting ML Models☆52Feb 16, 2023Updated 3 years ago
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- ☆13Feb 7, 2023Updated 3 years ago
- ☆17Aug 30, 2025Updated 6 months ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- This repository contains code for the paper "Better Estimation of the KL Divergence Between Language Models"☆18May 30, 2025Updated 9 months ago
- ☆11Sep 25, 2025Updated 5 months ago
- Matrix tools for building and inspecting latent spaces☆27Aug 19, 2018Updated 7 years ago
- ☆16Dec 9, 2023Updated 2 years ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Mar 20, 2024Updated last year
- CS224S Course Project☆14Jun 9, 2014Updated 11 years ago
- An interactive tool for analyzing, executing, and improving dynamic programming algorithms.☆19Jan 30, 2026Updated last month
- This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.☆20Feb 12, 2023Updated 3 years ago
- Data for EMNLP 2022 paper "arXivEdits: Understanding the Human Revision Process in Scientific Writing".☆14Sep 30, 2023Updated 2 years ago