sssth / awesome-DPOLinks
papers related to Direct Preference Optimization(DPO)
☆17Updated last year
Alternatives and similar repositories for awesome-DPO
Users that are interested in awesome-DPO are comparing it to the libraries listed below
Sorting:
- Awesome RL-based LLM Reasoning☆592Updated 3 weeks ago
- ☆52Updated 2 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆244Updated 2 months ago
- The latest progress of Personalized Large Language Models (LLMs).☆23Updated this week
- Paper list for Efficient Reasoning.☆586Updated this week
- A collection on the recent reproduction papers and projects on DeepSeek-R1☆32Updated 5 months ago
- ☆151Updated 11 months ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond☆280Updated last month
- ☆255Updated last month
- Yelp Simulator for WWW'25 AgentSociety Challenge☆81Updated 3 months ago
- A list of awesome papers on LLM tool learning.☆25Updated last year
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆133Updated 3 weeks ago
- The awesome agents in the era of large language models☆68Updated last year
- Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models☆551Updated this week
- ☆105Updated 2 months ago
- ☆56Updated this week
- ☆310Updated 2 months ago
- ☆360Updated 4 months ago
- 🔥 How to efficiently and effectively compress the CoTs or directly generate concise CoTs during inference while maintaining the reasonin…☆55Updated 2 months ago
- awesome papers in LLM interpretability☆530Updated 3 weeks ago
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆68Updated 4 months ago
- Latest Advances on Long Chain-of-Thought Reasoning☆470Updated 3 weeks ago
- Survey on LLM Agents (Published on CoLing 2025)☆358Updated 3 months ago
- A series of technical report on Slow Thinking with LLM☆715Updated 2 months ago
- Awesome RL Reasoning Recipes ("Triple R")☆768Updated last month
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆149Updated this week
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆282Updated last month
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆185Updated this week
- ☆545Updated 7 months ago
- Awesome Agent Training☆208Updated this week