liushunyu/awesome-direct-preference-optimization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/liushunyu/awesome-direct-preference-optimization)

liushunyu / awesome-direct-preference-optimization

A Survey of Direct Preference Optimization (DPO)

☆95

Alternatives and similar repositories for awesome-direct-preference-optimization

Users that are interested in awesome-direct-preference-optimization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jiaconghu / Model-Doctor
View on GitHub
Model Doctor: A Simple Gradient Aggregation Strategy for Diagnosing and Treating CNN Classifiers [https://arxiv.org/pdf/2112.04934.pdf]
☆15May 13, 2023Updated 3 years ago
jiaconghu / Model-LEGO
View on GitHub
Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks
☆17Jan 15, 2025Updated last year
jiaconghu / Transformer-Doctor
View on GitHub
Transformer Doctor: Diagnosing and Treating Vision Transformers
☆11Jan 15, 2025Updated last year
zju-vipa / Odyssey
View on GitHub
Odyssey: Empowering Minecraft Agents with Open-World Skills
☆396Oct 22, 2025Updated 9 months ago
wantbook-book / SeRL
View on GitHub
SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data
☆24Jan 24, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
BangHonor / DisCo
View on GitHub
Official codebase for paper Disentangled Condensation for Graphs (DisCo). This codebase is based on the open-source Pytorch Geometric fra…
☆11Feb 12, 2025Updated last year
Star607 / STEGA
View on GitHub
The official implementation of Spatiotemporal Gated Traffic Trajectory Simulation with Semantic-aware Graph Learning (Information Fusion …
☆10May 6, 2024Updated 2 years ago
MaybeLizzy / PERMU
View on GitHub
☆34Oct 4, 2025Updated 9 months ago
Star607 / Cross-city-Mobility-Transformer
View on GitHub
The official implementation of "COLA: Cross-city Mobility Transformer for Human Trajectory Simulation".
☆22Jan 31, 2026Updated 5 months ago
tanganke / fusion_bench
View on GitHub
FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion
☆235Jun 23, 2026Updated last month
BangHonor / SimGC
View on GitHub
Simple Graph Condensation
☆13Feb 26, 2025Updated last year
zhfeing / SchemaNet-PyTorch
View on GitHub
Official PyTorch implementation of paper "Schema Inference for Interpretable Image Classification" (ICLR 2023)
☆16Apr 6, 2023Updated 3 years ago
sastpg / RFTT
View on GitHub
RFTT: Reasoning with Reinforced Functional Token Tuning
☆29Feb 12, 2026Updated 5 months ago
OATML-Markslab / Protriever
View on GitHub
Official repository for the Protriever paper
☆17Jun 5, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
WANGBohaO-jpg / GDRT
View on GitHub
[WWW2026] The official code for paper "Does LLM Focus on the Right Words? Mitigating Context Bias in LLM-based Recommenders"
☆23Jan 23, 2026Updated 6 months ago
Arking1995 / COHO
View on GitHub
[ECCV 2024 Oral] The official implementation of paper: COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation
☆13Aug 13, 2024Updated last year
tcmyxc / Strong-Baselines-for-CIFAR100
View on GitHub
一个CIFAR100数据集的强基线结果
☆20Nov 23, 2025Updated 8 months ago
tmkasun / streaming_graph_partitioning
View on GitHub
Streaming Graph Server with partitioning
☆15Aug 17, 2023Updated 2 years ago
nju-websoft / CtrlProt
View on GitHub
Controllable Protein Sequence Generation with LLM Preference Optimization, AAAI 2025
☆17Mar 10, 2025Updated last year
sssth / awesome-DPO
View on GitHub
papers related to Direct Preference Optimization（DPO）
☆20Jul 16, 2024Updated 2 years ago
Gleghorn-Lab / DSM
View on GitHub
Protein representation and design under a single training scheme
☆24May 17, 2026Updated 2 months ago
SophieZheng998 / ALI-Agent
View on GitHub
Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation"
☆21Jan 31, 2026Updated 5 months ago
pritamqu / HALVA
View on GitHub
[ICLR 2025] Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination
☆21Jan 27, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
luli-git / MAP
View on GitHub
MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation
☆18Sep 2, 2024Updated last year
gersteinlab / BC-Design
View on GitHub
BC-Design: A Biochemistry-Aware Framework for High-Precision Inverse Protein Folding https://www.biorxiv.org/content/10.1101/2024.10.28.6…
☆22Nov 24, 2025Updated 8 months ago
ChangyuChen347 / MaskedThought
View on GitHub
[ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models
☆27Jul 9, 2024Updated 2 years ago
sastpg / CoVo
View on GitHub
Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning
☆25Jun 25, 2025Updated last year
ai4protein / Venus-MAXWELL
View on GitHub
Source code of Venus-MAXWELL: Efficient Learning of Protein-Mutation Stability Landscapes using Protein Language Models
☆25Jun 3, 2025Updated last year
qitianwu / AdvDIFFormer
View on GitHub
The official implementation for ICML2025 paper "Supercharging Graph Transformers with Advective Diffusion"
☆15Jul 2, 2025Updated last year
oaimli / PeerSum
View on GitHub
The dataset and code for PeerSum at EMNLP'23.
☆16Oct 20, 2025Updated 9 months ago
OpenGSL / OpenGSL
View on GitHub
☆183Jan 8, 2025Updated last year
Bitbol-Lab / rag-esm
View on GitHub
RAG-ESM is a retrieval-augmented framework that allows to condition pretrained ESM2 protein language models on homologous sequences
☆27Aug 21, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mitmedialab / empathic-stories
View on GitHub
☆17Nov 18, 2024Updated last year
yanty123 / OLiDM
View on GitHub
[AAAI 2025] OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving
☆22Dec 24, 2024Updated last year
opendatalab / HA-DPO
View on GitHub
Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization
☆104Jan 30, 2024Updated 2 years ago
SuburbiaXX / NUAA-OS
View on GitHub
NUAA操作系统实验课
☆10Jun 23, 2023Updated 3 years ago
jozhang97 / ambient-proteins
View on GitHub
Official code release for Ambient Protein Diffusion
☆35Aug 30, 2025Updated 10 months ago
GigaAI-research / WonderFree
View on GitHub
☆19Jun 26, 2025Updated last year
laihuiyuan / multilingual-tst
View on GitHub
Multilingual Pre-training with Language and Task Adaptation for Multilingual Text Style Transfer (ACL 2022)
☆10Sep 22, 2022Updated 3 years ago