A Survey of Direct Preference Optimization (DPO)
☆90Jul 4, 2025Updated 9 months ago
Alternatives and similar repositories for awesome-direct-preference-optimization
Users that are interested in awesome-direct-preference-optimization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks☆17Jan 15, 2025Updated last year
- Transformer Doctor: Diagnosing and Treating Vision Transformers☆11Jan 15, 2025Updated last year
- SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data☆22Jan 24, 2026Updated 2 months ago
- Official codebase for paper Disentangled Condensation for Graphs (DisCo). This codebase is based on the open-source Pytorch Geometric fra…☆11Feb 12, 2025Updated last year
- The official implementation of Spatiotemporal Gated Traffic Trajectory Simulation with Semantic-aware Graph Learning (Information Fusion …☆10May 6, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [AAAI 2025] Holistic Semantic Representation for Navigational Trajectory Generation☆18Mar 7, 2026Updated last month
- [ICML 2023] Decentralized SGD and Average-direction SAM are Asymptotically Equivalent☆19Dec 4, 2023Updated 2 years ago
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆209Mar 31, 2026Updated 2 weeks ago
- Simple Graph Condensation☆13Feb 26, 2025Updated last year
- 一个CIFAR100数据集的强基线结果☆20Nov 23, 2025Updated 4 months ago
- ☆32Oct 4, 2025Updated 6 months ago
- Softened ROSA QKV Operators for Training Next-Generation LLM Models☆36Apr 7, 2026Updated last week
- [IEEE Transactions on Power Systems] Transmission Interface Power Flow Adjustment: A Deep Reinforcement Learning Approach based on Multi-…☆25Jun 2, 2024Updated last year
- [AAAI 2023 Oral] Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition☆39Jun 3, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆220Jun 17, 2025Updated 9 months ago
- Cut2Next: Generating Next Shot via In-Context Tuning☆31Aug 21, 2025Updated 7 months ago
- This is a list of awesome prototype-based papers for explainable artificial intelligence.☆41Dec 12, 2022Updated 3 years ago
- RFTT: Reasoning with Reinforced Functional Token Tuning☆29Feb 12, 2026Updated 2 months ago
- A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges☆273Feb 10, 2025Updated last year
- Streaming Graph Server with partitioning☆15Aug 17, 2023Updated 2 years ago
- [IEEE Transactions on Intelligent Transportation Systems] Curricular Subgoal for Inverse Reinforcement Learning☆17Jul 31, 2023Updated 2 years ago
- ☆27Jun 2, 2025Updated 10 months ago
- A survey for visual generation alignment☆133Nov 9, 2025Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- a tools to process human 3D model, such as model visualization, model fitting, etc.☆14Mar 14, 2022Updated 4 years ago
- Awesome lists about all kinds of awesome skills to help you go out of 35 crisis, and most important, to tell you how to enjoy your life.☆19Jul 9, 2022Updated 3 years ago
- The dataset and code for PeerSum at EMNLP'23.☆16Oct 20, 2025Updated 5 months ago
- [AAAI 2025] OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving☆22Dec 24, 2024Updated last year
- [AAAI 2025 Oral] GURecon: Learning Detailed 3D Geometric Uncertainties for Neural Surface Reconstruction☆23Jul 21, 2025Updated 8 months ago
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆15Jul 31, 2025Updated 8 months ago
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆16Sep 2, 2024Updated last year
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆18Feb 20, 2025Updated last year
- The official implementation of the paper "Topology-aware Generalization of Decentralized SGD"☆37Mar 29, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This repository contains the code and pre-trained models for our paper☆23Jun 29, 2025Updated 9 months ago
- ☆18Mar 30, 2025Updated last year
- SIGGRAPH 2022 Labs Demo for "Learning Smooth Neural Functions via Lipschitz Regularization"☆30Aug 5, 2022Updated 3 years ago
- Comprehensive Benchmark Dataset for Dynamic Text-Attributed Graphs☆49Nov 6, 2024Updated last year
- Pytorch-ImageSegmentation☆10Nov 7, 2019Updated 6 years ago
- 收集和梳理病理AI大模型相关☆22Oct 17, 2025Updated 5 months ago
- A full-text error corrector for English based on transformers and deep learning☆10Jan 8, 2023Updated 3 years ago