Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).
☆14Apr 4, 2025Updated last year
Alternatives and similar repositories for regularized-bon
Users that are interested in regularized-bon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP 2024] Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by …☆16Nov 27, 2024Updated last year
- ヒューリスティック探索入門☆20Dec 9, 2023Updated 2 years ago
- ☆17Jun 14, 2023Updated 2 years ago
- ☆32Oct 2, 2025Updated 8 months ago
- Code for magnetic mirror descent.☆18Oct 5, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [EMNLP2025] Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling☆17Nov 20, 2025Updated 6 months ago
- ☆18Jun 3, 2024Updated 2 years ago
- A deep research framework☆30Apr 21, 2026Updated last month
- ☆15Nov 20, 2025Updated 6 months ago
- The source code for "A Simple Graph Contrastive Learning Framework for Short Text Classification"☆13Aug 14, 2025Updated 9 months ago
- ☆13May 11, 2024Updated 2 years ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated last year
- Demonstrating the usage of FGYM: A Toolkit for benchmarking FPGA-accelerated Reinforcement Learning☆14Aug 12, 2021Updated 4 years ago
- docker for UTH-BERT: https://ai-health.m.u-tokyo.ac.jp/uth-bert☆14Mar 24, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- PyTorch implementation of Count-Based Exploration with Neural Density Models☆10Mar 22, 2018Updated 8 years ago
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).☆27Aug 25, 2024Updated last year
- MultiboxBot is a bot for multiboxing on WoW with up to 40 accounts using DLL injection, hooking and sockets.☆17Updated this week
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Dec 8, 2022Updated 3 years ago
- The entire year on a single page☆12Dec 5, 2025Updated 6 months ago
- ☆10Feb 12, 2026Updated 3 months ago
- ☆13Jul 2, 2025Updated 11 months ago
- Thesis project about Visual Anomaly Detection based on Self Supervised Learning. The model identifies anomalies from information acquired…☆10Apr 14, 2023Updated 3 years ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆63Aug 30, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 一个用于课程小论文排版的LaTeX模板。☆10Oct 21, 2019Updated 6 years ago
- ☆44Sep 19, 2024Updated last year
- ☆20Jan 26, 2026Updated 4 months ago
- When Reasoning Meets Its Laws☆37Jan 2, 2026Updated 5 months ago
- ViLoMem: Agentic Learner with Grow-and-Refine Multimodal Semantic Memory☆65Apr 21, 2026Updated last month
- The official Python SDK for FastLabel API, the Data Platform for AI☆16Jun 1, 2026Updated last week
- 中文短文本数据集,用于短文本分类研究,涉及情感分类、多分类等,发布的中文公开短文本数据集☆19Aug 16, 2024Updated last year
- ☆21Sep 24, 2020Updated 5 years ago
- 更纯粹、更高压缩率的Tokenizer in Rust☆13Dec 21, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ウェブサイト「サンプルで学ぶ Go 言語」のソースコード☆17Aug 17, 2024Updated last year
- ☆14Nov 15, 2022Updated 3 years ago
- A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA.☆13Jan 13, 2026Updated 4 months ago
- Python Vector Search tutorial generated using gpt4☆12Mar 18, 2023Updated 3 years ago
- [NeurIPS 2024] Low rank memory efficient optimizer without SVD☆33Jul 1, 2025Updated 11 months ago
- ☆18Mar 3, 2025Updated last year
- Code for Research Project TLDR☆25Jul 28, 2025Updated 10 months ago