Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".
☆29Oct 30, 2024Updated last year
Alternatives and similar repositories for MOD
Users that are interested in MOD are comparing it to the libraries listed below
Sorting:
- Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"☆79Jun 10, 2025Updated 8 months ago
- Rewarded soups official implementation☆62Sep 27, 2023Updated 2 years ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆96Aug 20, 2024Updated last year
- ☆10Mar 6, 2022Updated 3 years ago
- Our EMNLP 2022 paper on VIP-Based Prompting for Parameter-Efficient Learning☆10Oct 22, 2022Updated 3 years ago
- ☆12Oct 28, 2022Updated 3 years ago
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆13Apr 9, 2024Updated last year
- ☆12Jan 5, 2023Updated 3 years ago
- Code for☆16Oct 16, 2020Updated 5 years ago
- Official repo for Trumpets: Injective Flows for Inference and Inverse Problems☆13Jun 2, 2021Updated 4 years ago
- Code Release for "Self-supervised Learning is More Robust to Dataset Imbalance"☆39Feb 11, 2022Updated 4 years ago
- ☆24Jun 22, 2021Updated 4 years ago
- [ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities☆28Apr 2, 2025Updated 10 months ago
- ☆28Jul 16, 2024Updated last year
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated last year
- ☆46Feb 8, 2024Updated 2 years ago
- Lightweight Adapting for Black-Box Large Language Models☆25Feb 15, 2024Updated 2 years ago
- Code to reproduce results from "Invertible generative models for inverse problems: mitigating representation error and dataset bias"☆21Jul 9, 2020Updated 5 years ago
- Bayesian Optimization with Density-Ratio Estimation☆24Dec 26, 2022Updated 3 years ago
- ☆22Jun 11, 2021Updated 4 years ago
- Random Mesh Projectors for Inverse Problems☆24Apr 13, 2021Updated 4 years ago
- TARGET is a benchmark for evaluating Table Retrieval for Generative Tasks such as Fact Verification and Text-to-SQL☆28Jul 14, 2025Updated 7 months ago
- ☆30Jun 19, 2023Updated 2 years ago
- Random feature latent variable models in Python☆23Jul 23, 2023Updated 2 years ago
- Efficient Conditionally Invariant Representation Learning (ICLR 2023, Oral)☆21Nov 27, 2022Updated 3 years ago
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆29Jun 4, 2024Updated last year
- Influence Estimation for Gradient-Boosted Decision Trees☆29May 27, 2024Updated last year
- ☆30Jun 3, 2022Updated 3 years ago
- [ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆32Jan 7, 2026Updated last month
- Official code for paper LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning☆29Jun 11, 2021Updated 4 years ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆127Mar 30, 2024Updated last year
- ☆28Sep 13, 2021Updated 4 years ago
- The core repository of the elsciRL framework.☆18Dec 8, 2025Updated 2 months ago
- ☆35May 30, 2022Updated 3 years ago
- ☆35Jul 5, 2023Updated 2 years ago
- [ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following☆137Jul 8, 2024Updated last year
- ☆35Oct 14, 2023Updated 2 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- Code for the paper "SMACE: A New Method for the Interpretability of Composite Decision Systems", ECML 2022☆15Apr 17, 2023Updated 2 years ago