Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).
☆14Apr 4, 2025Updated 10 months ago
Alternatives and similar repositories for regularized-bon
Users that are interested in regularized-bon are comparing it to the libraries listed below
Sorting:
- Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lo…☆16Nov 27, 2024Updated last year
- ☆17Jun 14, 2023Updated 2 years ago
- ☆18Jun 3, 2024Updated last year
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).☆26Aug 25, 2024Updated last year
- Deploying a custom pytorch model to AWS Sagemaker using terraform and FastAPI☆10Nov 10, 2023Updated 2 years ago
- An IOT based mobile application to monitor the vitals such as ECG, Body Temperature, Blood Pressure using an ESP32 DevKit and React Nativ…☆11Nov 14, 2024Updated last year
- ☆47Nov 8, 2024Updated last year
- A Controllable Model of Grounded Response Generation (AAAI 21)☆13Oct 25, 2022Updated 3 years ago
- ☆11Updated this week
- 更纯粹、更高压缩率的Tokenizer in Rust☆13Dec 21, 2024Updated last year
- Repository of paper "Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis" (ACL 2025 Main)☆19Jul 19, 2025Updated 7 months ago
- Public Codebase supporting the paper "Modeling Cellular Perturbations with The Sparse Additive Mechanism Shift Variational Autoencoder" b…☆14Oct 20, 2023Updated 2 years ago
- An official PyTorch implementation of "Certifiably Robust Graph Contrastive Learning" (NeurIPS 2023)☆11Jan 22, 2024Updated 2 years ago
- 해커그라운드 해커톤 2024☆12Aug 26, 2024Updated last year
- Official pytorch implementation of "Tool-R1: Sample-Efficient Reinforcement Learning for Agentic Tool Use"☆20Sep 16, 2025Updated 5 months ago
- LIDA: Lightweight Interactive Dialogue Annotator (in EMNLP 2019)☆10Oct 18, 2021Updated 4 years ago
- Code for "What really matters in matrix-whitening optimizers?"☆21Oct 31, 2025Updated 4 months ago
- Review of dental related datasets for machine learning☆12Updated this week
- 一个用于课程小论文排版的LaTeX模板。☆10Oct 21, 2019Updated 6 years ago
- Official repository for "EduBench: A Comprehensive Benchmarking Dataset for Evaluating Large Language Models in Diverse Educational Scena…☆19May 28, 2025Updated 9 months ago
- ☆19Jul 31, 2025Updated 7 months ago
- ☆12Nov 15, 2022Updated 3 years ago
- ☆11Oct 12, 2023Updated 2 years ago
- ☆12Feb 11, 2026Updated 2 weeks ago
- The repository of "Addressing Shortcomings in Fair Graph Learning Datasets: Towards a New Benchmark" (KDD'24)☆13Jan 27, 2026Updated last month
- Python Vector Search tutorial generated using gpt4☆12Mar 18, 2023Updated 2 years ago
- vue,IM☆13Jun 8, 2018Updated 7 years ago
- TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment☆10Mar 1, 2025Updated last year
- Official code of our work, Representation Learning for Resource-Constrained Keyphrase Generation.☆11May 26, 2022Updated 3 years ago
- Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering☆63Dec 5, 2024Updated last year
- Stable Diffusion web UI☆10Mar 17, 2024Updated last year
- ☆14Aug 19, 2024Updated last year
- Short RL☆18May 26, 2025Updated 9 months ago
- IJCAI 2024 Shap-Mix: Shapley Value Guided Mixing for Long-Tailed Skeleton Based Action Recognition☆14Nov 25, 2024Updated last year
- ☆17Mar 3, 2025Updated 11 months ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 4 months ago
- A framework for majority vote classifiers allowing for computation of PAC Bayesian risk bounds.☆14Feb 9, 2023Updated 3 years ago
- ☆12Jul 17, 2023Updated 2 years ago
- ☆14Jun 13, 2024Updated last year