general-preference / general-preference-modelLinks

[ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment (https://arxiv.org/abs/2410.02197)

☆29

Alternatives and similar repositories for general-preference-model

Users that are interested in general-preference-model are comparing it to the libraries listed below

Sorting:

GAIR-NLP / MetaCritique
Evaluate the Quality of Critique
☆36Updated last year
RenzeLou / AAAR-1.0
The source code for running LLMs on the AAAR-1.0 benchmark.
☆17Updated 6 months ago
mathllm / Step-Controlled_DPO
☆22Updated last year
googleinterns / localizing-paragraph-memorization
☆15Updated last year
RUCAIBox / RLMEC
The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"
☆38Updated last year
chujiezheng / LLM-Extrapolation
Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"
☆75Updated 4 months ago
ernie-research / Tool-Augmented-Reward-Model
[ICLR'24 spotlight] Tool-Augmented Reward Modeling
☆51Updated 4 months ago
HKUNLP / STRING
[ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"
☆77Updated 10 months ago
john-hewitt / implicit-ins
Codebase for Instruction Following without Instruction Tuning
☆35Updated last year
sail-sg / dice
Official implementation of Bootstrapping Language Models via DPO Implicit Rewards
☆44Updated 5 months ago
hamishivi / automated-instruction-selection
Exploration of automated dataset selection approaches at large scales.
☆47Updated 7 months ago
wwxu21 / CUT
Source code of "Reasons to Reject? Aligning Language Models with Judgments"
☆58Updated last year
gl-ybnbxb / BoNBoN
☆18Updated last year
GAIR-NLP / ReasonEval
[AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy
☆69Updated this week
yiqingxyq / RepoST
Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"
☆22Updated 6 months ago
open-compass / GPassK
[ACL 2025] Are Your LLMs Capable of Stable Reasoning?
☆30Updated 2 months ago
yayayacc / MUR
☆45Updated last week
ChengpengLi1003 / DotaMath
☆30Updated 9 months ago
RLHFlow / Directional-Preference-Alignment
Directional Preference Alignment
☆57Updated last year
TianduoWang / DPO-ST
[ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning
☆51Updated last year
qtli / GSM-Plus
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
☆63Updated last year
Re-Align / AlignTDS
Analyzing LLM Alignment via Token distribution shift
☆17Updated last year
GAIR-NLP / benbench
Benchmarking Benchmark Leakage in Large Language Models
☆55Updated last year
liziniu / GEM
Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)
☆41Updated 5 months ago
icip-cas / SSO
A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…
☆20Updated 10 months ago
chtmp223 / suri
Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)
☆25Updated last week
satrams / rent-rl
RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.
☆40Updated 3 months ago
wzq016 / PINE
Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""
☆17Updated 4 months ago
dqxiu / KAssess
☆14Updated last year
RUCAIBox / JiuZhang3.0
The code and data for the paper JiuZhang3.0
☆49Updated last year