A Survey of Direct Preference Optimization (DPO)
☆95Jul 4, 2025Updated 10 months ago
Alternatives and similar repositories for awesome-direct-preference-optimization
Users that are interested in awesome-direct-preference-optimization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Model Doctor: A Simple Gradient Aggregation Strategy for Diagnosing and Treating CNN Classifiers [https://arxiv.org/pdf/2112.04934.pdf]☆15May 13, 2023Updated 2 years ago
- Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks☆17Jan 15, 2025Updated last year
- Official codebase for paper Disentangled Condensation for Graphs (DisCo). This codebase is based on the open-source Pytorch Geometric fra…☆11Feb 12, 2025Updated last year
- The official implementation of Spatiotemporal Gated Traffic Trajectory Simulation with Semantic-aware Graph Learning (Information Fusion …☆10May 6, 2024Updated 2 years ago
- The official implementation of 'Spatiotemporal-Augmented Graph Neural Networks for Human Mobility Simulation'.☆15Nov 2, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [SIGKDD' 24] PyTorch implementation of Temporal Prototype-Aware Learning for Active Voltage Control on Power Distribution Networks☆14Jul 28, 2024Updated last year
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆219Apr 27, 2026Updated last week
- 一个CIFAR100数据集的强基线结果☆20Nov 23, 2025Updated 5 months ago
- ☆32Oct 4, 2025Updated 7 months ago
- [IEEE Transactions on Power Systems] Transmission Interface Power Flow Adjustment: A Deep Reinforcement Learning Approach based on Multi-…☆25Jun 2, 2024Updated last year
- [AAAI 2023 Oral] Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition☆39Jun 3, 2024Updated last year
- [TPAMI] Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning☆33May 17, 2024Updated last year
- ☆219Jun 17, 2025Updated 10 months ago
- [ECCV 2024 Oral] The official implementation of paper: COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation☆11Aug 13, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Cut2Next: Generating Next Shot via In-Context Tuning☆32Aug 21, 2025Updated 8 months ago
- This is a list of awesome prototype-based papers for explainable artificial intelligence.☆41Dec 12, 2022Updated 3 years ago
- RFTT: Reasoning with Reinforced Functional Token Tuning☆29Feb 12, 2026Updated 2 months ago
- A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges☆274Feb 10, 2025Updated last year
- Streaming Graph Server with partitioning☆15Aug 17, 2023Updated 2 years ago
- ☆22Oct 26, 2023Updated 2 years ago
- [IEEE Transactions on Intelligent Transportation Systems] Curricular Subgoal for Inverse Reinforcement Learning☆18Jul 31, 2023Updated 2 years ago
- A survey for visual generation alignment☆138Nov 9, 2025Updated 5 months ago
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆27Jul 9, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- a tools to process human 3D model, such as model visualization, model fitting, etc.☆14Mar 14, 2022Updated 4 years ago
- The dataset and code for PeerSum at EMNLP'23.☆16Oct 20, 2025Updated 6 months ago
- [AAAI 2025 Oral] GURecon: Learning Detailed 3D Geometric Uncertainties for Neural Surface Reconstruction☆23Jul 21, 2025Updated 9 months ago
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆18Feb 20, 2025Updated last year
- ☆17Nov 18, 2024Updated last year
- SIGGRAPH 2022 Labs Demo for "Learning Smooth Neural Functions via Lipschitz Regularization"☆30Aug 5, 2022Updated 3 years ago
- Comprehensive Benchmark Dataset for Dynamic Text-Attributed Graphs☆49Nov 6, 2024Updated last year
- 求是潮网站后端开发入门☆15Oct 10, 2014Updated 11 years ago
- search-rattailcollagen1 created by GitHub Classroom☆10Jan 17, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 收集和梳理病理AI大模型相关☆23Oct 17, 2025Updated 6 months ago
- A full-text error corrector for English based on transformers and deep learning☆10Jan 8, 2023Updated 3 years ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- ☆11Jul 30, 2025Updated 9 months ago
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…☆11Dec 30, 2024Updated last year
- (ACL 2025 Main) Distilling RAG for SLMs from LLMs to Transfer Knowledge and Mitigate Hallucination via Evidence and Graph-based Distillat…☆34Aug 23, 2025Updated 8 months ago
- ☆13Jan 12, 2024Updated 2 years ago