A Survey of Direct Preference Optimization (DPO)
☆97Jul 4, 2025Updated 10 months ago
Alternatives and similar repositories for awesome-direct-preference-optimization
Users that are interested in awesome-direct-preference-optimization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Transformer Doctor: Diagnosing and Treating Vision Transformers☆11Jan 15, 2025Updated last year
- Odyssey: Empowering Minecraft Agents with Open-World Skills☆384Oct 22, 2025Updated 7 months ago
- Official codebase for paper Disentangled Condensation for Graphs (DisCo). This codebase is based on the open-source Pytorch Geometric fra…☆11Feb 12, 2025Updated last year
- The official implementation of Spatiotemporal Gated Traffic Trajectory Simulation with Semantic-aware Graph Learning (Information Fusion …☆10May 6, 2024Updated 2 years ago
- The official implementation of 'Spatiotemporal-Augmented Graph Neural Networks for Human Mobility Simulation'.☆15Nov 2, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [AAAI 2025] Holistic Semantic Representation for Navigational Trajectory Generation☆19Mar 7, 2026Updated 2 months ago
- Simple Graph Condensation☆13Feb 26, 2025Updated last year
- 一个CIFAR100数据集的强基线结果☆20Nov 23, 2025Updated 6 months ago
- ☆32Oct 4, 2025Updated 7 months ago
- [IEEE Transactions on Power Systems] Transmission Interface Power Flow Adjustment: A Deep Reinforcement Learning Approach based on Multi-…☆25Jun 2, 2024Updated last year
- Official PyTorch implementation of paper "Schema Inference for Interpretable Image Classification" (ICLR 2023)☆15Apr 6, 2023Updated 3 years ago
- [AAAI 2023 Oral] Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition☆39Jun 3, 2024Updated last year
- [TPAMI] Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning☆33May 17, 2024Updated 2 years ago
- [ECCV 2024 Oral] The official implementation of paper: COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation☆11Aug 13, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Cut2Next: Generating Next Shot via In-Context Tuning☆33Aug 21, 2025Updated 9 months ago
- A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges☆276Feb 10, 2025Updated last year
- ☆22Oct 26, 2023Updated 2 years ago
- [IEEE Transactions on Intelligent Transportation Systems] Curricular Subgoal for Inverse Reinforcement Learning☆18Jul 31, 2023Updated 2 years ago
- papers related to Direct Preference Optimization(DPO)☆20Jul 16, 2024Updated last year
- a tools to process human 3D model, such as model visualization, model fitting, etc.☆14Mar 14, 2022Updated 4 years ago
- Awesome lists about all kinds of awesome skills to help you go out of 35 crisis, and most important, to tell you how to enjoy your life.☆19Jul 9, 2022Updated 3 years ago
- [AAAI 2025] OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving☆22Dec 24, 2024Updated last year
- This is a collection of resources on AI-AR-ART generation.☆28Dec 14, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Poaps for FutureX☆10Jul 28, 2023Updated 2 years ago
- [AAAI 2025 Oral] GURecon: Learning Detailed 3D Geometric Uncertainties for Neural Surface Reconstruction☆23Jul 21, 2025Updated 10 months ago
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆17Sep 2, 2024Updated last year
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆15Jul 31, 2025Updated 9 months ago
- SIGGRAPH 2022 Labs Demo for "Learning Smooth Neural Functions via Lipschitz Regularization"☆30Aug 5, 2022Updated 3 years ago
- PyTorch Implementation of Query-Aware Sequential Recommendation (CIKM'22)☆13Sep 28, 2022Updated 3 years ago
- Comprehensive Benchmark Dataset for Dynamic Text-Attributed Graphs☆50Nov 6, 2024Updated last year
- A full-text error corrector for English based on transformers and deep learning☆10Jan 8, 2023Updated 3 years ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Jul 30, 2025Updated 9 months ago
- (ACL 2025 Main) Distilling RAG for SLMs from LLMs to Transfer Knowledge and Mitigate Hallucination via Evidence and Graph-based Distillat…☆34Aug 23, 2025Updated 9 months ago
- Code related to ’Beyond spectral gap: The role of the topology in decentralized learning‘.☆14Jun 7, 2022Updated 3 years ago
- SMART introduces a novel test-time framework where Small Language Models (SLMs) reason step-by-step, and Large Language Models (LLMs) pro…☆12Jul 9, 2025Updated 10 months ago
- Advances on machine learning of graphs, covering the reading list of recent top academic conferences.☆237May 12, 2026Updated 2 weeks ago
- 本项目展示了2022年部分信息检索/数据挖掘顶会论文分类。☆17Jun 13, 2022Updated 3 years ago
- NUST-API集合☆10Oct 29, 2018Updated 7 years ago