A Survey of Direct Preference Optimization (DPO)
☆97Jul 4, 2025Updated 11 months ago
Alternatives and similar repositories for awesome-direct-preference-optimization
Users that are interested in awesome-direct-preference-optimization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks☆17Jan 15, 2025Updated last year
- SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data☆23Jan 24, 2026Updated 4 months ago
- Official codebase for paper Disentangled Condensation for Graphs (DisCo). This codebase is based on the open-source Pytorch Geometric fra…☆11Feb 12, 2025Updated last year
- The official implementation of 'Spatiotemporal-Augmented Graph Neural Networks for Human Mobility Simulation'.☆15Nov 2, 2024Updated last year
- [AAAI 2025] Holistic Semantic Representation for Navigational Trajectory Generation☆19Mar 7, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICML 2023] Decentralized SGD and Average-direction SAM are Asymptotically Equivalent☆20Dec 4, 2023Updated 2 years ago
- [SIGKDD' 24] PyTorch implementation of Temporal Prototype-Aware Learning for Active Voltage Control on Power Distribution Networks☆14Jul 28, 2024Updated last year
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆229Updated this week
- Simple Graph Condensation☆13Feb 26, 2025Updated last year
- ☆32Oct 4, 2025Updated 8 months ago
- Softened ROSA QKV Operators for Training Next-Generation LLM Models☆38Apr 7, 2026Updated 2 months ago
- [IEEE Transactions on Power Systems] Transmission Interface Power Flow Adjustment: A Deep Reinforcement Learning Approach based on Multi-…☆25Jun 2, 2024Updated 2 years ago
- Official PyTorch implementation of paper "Schema Inference for Interpretable Image Classification" (ICLR 2023)☆15Apr 6, 2023Updated 3 years ago
- [TPAMI] Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning☆33May 17, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆217Jun 17, 2025Updated 11 months ago
- Cut2Next: Generating Next Shot via In-Context Tuning☆33Aug 21, 2025Updated 9 months ago
- This is a list of awesome prototype-based papers for explainable artificial intelligence.☆42Dec 12, 2022Updated 3 years ago
- A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges☆281Feb 10, 2025Updated last year
- a tools to process human 3D model, such as model visualization, model fitting, etc.☆14Mar 14, 2022Updated 4 years ago
- A survey for visual generation alignment☆142Nov 9, 2025Updated 7 months ago
- Awesome lists about all kinds of awesome skills to help you go out of 35 crisis, and most important, to tell you how to enjoy your life.☆19Jul 9, 2022Updated 3 years ago
- [AAAI 2025] OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving☆22Dec 24, 2024Updated last year
- [AAAI 2025 Oral] GURecon: Learning Detailed 3D Geometric Uncertainties for Neural Surface Reconstruction☆23Jul 21, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆15Jul 31, 2025Updated 10 months ago
- This repository contains the code and pre-trained models for our paper☆24Jun 29, 2025Updated 11 months ago
- Multilingual Pre-training with Language and Task Adaptation for Multilingual Text Style Transfer (ACL 2022)☆10Sep 22, 2022Updated 3 years ago
- SIGGRAPH 2022 Labs Demo for "Learning Smooth Neural Functions via Lipschitz Regularization"☆30Aug 5, 2022Updated 3 years ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- (ACL 2025 Main) Distilling RAG for SLMs from LLMs to Transfer Knowledge and Mitigate Hallucination via Evidence and Graph-based Distillat…☆35Aug 23, 2025Updated 9 months ago
- Code related to ’Beyond spectral gap: The role of the topology in decentralized learning‘.☆14Jun 7, 2022Updated 4 years ago
- ☆14Jun 11, 2023Updated 3 years ago
- Documentation at☆14Mar 27, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- model based reinforcement learning algorithms for unstable baselines☆15May 9, 2023Updated 3 years ago
- Advances on machine learning of graphs, covering the reading list of recent top academic conferences.☆243May 28, 2026Updated 2 weeks ago
- 2023 Gaussian Splatting Paper List(Arxiv)☆21Jan 11, 2024Updated 2 years ago
- Protein representation and design under a single training scheme☆24May 17, 2026Updated 3 weeks ago
- ManifoldNet Paper Implementation for SPD(n)☆11Nov 10, 2021Updated 4 years ago
- ☆17Jun 10, 2025Updated last year
- ☆181Jan 8, 2025Updated last year