shenao-zhang / reward-augmented-preferenceLinks

The official implementation of Preference Data Reward-Augmentation.

☆18

Alternatives and similar repositories for reward-augmented-preference

Users that are interested in reward-augmented-preference are comparing it to the libraries listed below

Sorting:

ALT-JS / OthelloSAE
CS194-196 Course Project
☆15Updated 9 months ago
TianheL / LM-Implicit-Reasoning
[ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts
☆16Updated 8 months ago
LAMDASZ-ML / Self-Backtracking
☆51Updated 9 months ago
yale-nlp / refdpo
☆16Updated last year
Geaming2002 / Ruler
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
☆39Updated last year
dinobby / MAgICoRE
☆24Updated last year
Hritikbansal / sparse_feedback
☆29Updated last year
clinicalml / co-llm
Co-LLM: Learning to Decode Collaboratively with Multiple Language Models
☆122Updated last year
shangshang-wang / Resa
Resa: Transparent Reasoning Models via SAEs
☆44Updated last month
WujiangXu / EPO
The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"
☆33Updated last month
yuleiqin / RAIF
A Recipe for Building LLM Reasoners to Solve Complex Instructions
☆28Updated last month
tianyi-lab / C3PO
[COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"
☆18Updated 7 months ago
MLLM-Data-Contamination / MM-Detect
This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"
☆16Updated last month
uclaml / COPS
The official implementation of Cross-Task Experience Sharing (COPS)
☆30Updated last year
shulin16 / MMInA
[ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents
☆47Updated 8 months ago
David-Li0406 / Preference-Leakage
☆51Updated 5 months ago
xufangzhi / Genius
[ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework
☆71Updated 5 months ago
F2-Song / Weak-to-Strong-Decoding
The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"
☆23Updated 4 months ago
UKPLab / arxiv2025-inherent-limits-plms
Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…
☆13Updated 10 months ago
SLIT-AI / WRPO
[ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion
☆13Updated 8 months ago
NiuTrans / ForgettingCurve
A benchmark for testing memorization abilities of LMs
☆20Updated last year
Zoeyyao27 / SirLLM
This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM
☆60Updated last year
limenlp / safer-instruct
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Updated last year
nick7nlp / FastCuRL
FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning
☆53Updated last month
metal-chart-generation / metal
☆40Updated 5 months ago
open-compass / Ada-LEval
The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"
☆55Updated 6 months ago
cxcscmu / Montessori-Instruct
Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]
☆48Updated 9 months ago
rohinmanvi / Capability-Aware-and-Mid-Generation-Self-Evaluations
☆21Updated 3 months ago
YutongWang1216 / DocMTAgent
Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
☆54Updated 9 months ago
tianyi-lab / R2-T2
[ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"
☆16Updated 8 months ago