[ACL 2025] "World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning." https://arxiv.org/abs/2503.10480
☆18Jul 22, 2025Updated 8 months ago
Alternatives and similar repositories for D2PO
Users that are interested in D2PO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Jul 22, 2025Updated 8 months ago
- [NeurIPS 2025] The official repository of "Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tun…☆40Feb 20, 2025Updated last year
- ☆12Jul 4, 2024Updated last year
- ☆12Dec 6, 2024Updated last year
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Code for "Thinking Forward: Memory-Efficient Federated Finetuning of Language Models" (NeurIPS 2024). Spry is a federated learning al…☆12Oct 8, 2024Updated last year
- [NeurIPS 2025 Spotlight] Official repository for "Web-Shepherd: Advancing PRMs for Reinforcing Web Agents"☆53May 21, 2025Updated 10 months ago
- FuseFL: One-Shot Federated Learning through the Lens of Causality with Progressive Model Fusion (NeurIPS 2024 Spotlight)☆14Mar 31, 2025Updated last year
- ☆17May 29, 2022Updated 3 years ago
- ☆10Jul 5, 2024Updated last year
- Official repository of LIBERO-plus, a generalized benchmark for in-depth robustness analysis of vision-language-action models.☆272Jan 21, 2026Updated 2 months ago
- Repo for Llatrieval☆31Aug 21, 2024Updated last year
- We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…☆289Mar 21, 2026Updated 3 weeks ago
- Responsible Robotic Manipulation☆15Aug 31, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Collection of papers about video-audio understanding☆25Dec 26, 2025Updated 3 months ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆29Dec 19, 2023Updated 2 years ago
- ☆22May 3, 2025Updated 11 months ago
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- [ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"☆12Dec 6, 2024Updated last year
- ☆22Jan 26, 2024Updated 2 years ago
- Gradient-based Next-best-view Planning☆17Nov 20, 2024Updated last year
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆27Oct 13, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 复旦研究生入 学教育测试☆24Aug 28, 2025Updated 7 months ago
- Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation☆27Dec 12, 2025Updated 3 months ago
- [NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effect…☆84Jun 17, 2024Updated last year
- [AAAI 2024] The Official implementation for 'SCD-Net: Spatiotemporal Clues Disentanglement Network for Self-supervised Skeleton-based Act…☆27Apr 20, 2024Updated last year
- Automatically check your GPA☆21Jun 17, 2025Updated 9 months ago
- Source codes and data for our IJCAI 2021 paper "Consistent Inference for Dialogue Relation Extraction".☆24Nov 27, 2021Updated 4 years ago
- ☆21Jan 17, 2025Updated last year
- Working note for WSI analysis☆10Apr 3, 2023Updated 3 years ago
- ☆24Jul 20, 2025Updated 8 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆55Nov 29, 2024Updated last year
- A novel Participation-Contributed Temporal Dynamic Model for Group Activity Recognition☆25Jan 24, 2021Updated 5 years ago
- ☆19Dec 23, 2024Updated last year
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆54Apr 3, 2026Updated last week
- Official implementation of the paper “Reconsidering Overthinking: Penalizing Internal and External Redundancy in CoT Reasoning”☆20Aug 20, 2025Updated 7 months ago
- Performant framework for training, analyzing and visualizing Sparse Autoencoders (SAEs) and their frontier variants.☆209Apr 4, 2026Updated last week
- Opensource school volunteer management system for ZHZX.☆17Jul 31, 2024Updated last year