A Survey of Direct Preference Optimization (DPO)
☆91Jul 4, 2025Updated 8 months ago
Alternatives and similar repositories for awesome-direct-preference-optimization
Users that are interested in awesome-direct-preference-optimization are comparing it to the libraries listed below
Sorting:
- Model Doctor: A Simple Gradient Aggregation Strategy for Diagnosing and Treating CNN Classifiers [https://arxiv.org/pdf/2112.04934.pdf]☆15May 13, 2023Updated 2 years ago
- Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks☆17Jan 15, 2025Updated last year
- SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data☆21Jan 24, 2026Updated last month
- Official codebase for paper Disentangled Condensation for Graphs (DisCo). This codebase is based on the open-source Pytorch Geometric fra…