liushunyu / awesome-direct-preference-optimization
A Survey of Direct Preference Optimization (DPO)
☆35Updated last month
Alternatives and similar repositories for awesome-direct-preference-optimization:
Users that are interested in awesome-direct-preference-optimization are comparing it to the libraries listed below
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆121Updated this week
- A collection on the recent reproduction papers and projects on DeepSeek-R1☆29Updated last month
- Model Doctor: A Simple Gradient Aggregation Strategy for Diagnosing and Treating CNN Classifiers [https://arxiv.org/pdf/2112.04934.pdf]☆15Updated last year
- papers related to Direct Preference Optimization(DPO)☆18Updated 9 months ago
- Transformer Doctor: Diagnosing and Treating Vision Transformers☆10Updated 3 months ago
- Accepted LLM Papers in NeurIPS 2024☆35Updated 6 months ago
- Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks☆16Updated 3 months ago
- ☆90Updated 3 months ago
- Official repository of "Can Language Models Solve Graph Problems in Natural Language?". NeurIPS 2023 (Spotlight)☆122Updated 8 months ago
- Yelp Simulator for WWW'25 AgentSociety Challenge☆74Updated 3 weeks ago
- ☆144Updated 7 months ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond☆191Updated this week
- [NeurIPS 2024] GITA: Graph to Image-Text Integration for Vision-Language Graph Reasoning☆49Updated 5 months ago
- [AAAI 2025] Holistic Semantic Representation for Navigational Trajectory Generation☆11Updated last week
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆156Updated 8 months ago
- ☆33Updated 2 weeks ago
- ☆91Updated 2 weeks ago
- ☆29Updated 6 months ago
- ☆28Updated 10 months ago
- Awesome RL-based LLM Reasoning☆450Updated last week
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆190Updated last week
- Awesome-Efficient-Inference-for-LRMs is a collection of state-of-the-art, novel, exciting, token-efficient methods for Large Reasoning Mo…☆59Updated last week
- Survey on Data-centric Large Language Models☆83Updated 9 months ago
- Simple Graph Condensation☆11Updated 2 months ago
- Paper List of Inference/Test Time Scaling/Computing☆195Updated this week
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆188Updated 4 months ago
- [NeurIPS 2024] Official implementation for paper "Can Graph Learning Improve Planning in LLM-based Agents?"☆120Updated 4 months ago
- Easily download anonymous Github repositories from https://anonymous.4open.science/ with a GUI interface☆96Updated 11 months ago
- [ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆98Updated 9 months ago
- This repository collects awesome survey, resource, and paper for Lifelong Learning for Large Language Models. (Updated Regularly)☆46Updated 2 months ago