[ACL 2025] "World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning." https://arxiv.org/abs/2503.10480
☆18Jul 22, 2025Updated 10 months ago
Alternatives and similar repositories for D2PO
Users that are interested in D2PO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Jul 22, 2025Updated 10 months ago
- [COLING 2025 Industry] LoRA Soups☆20Nov 29, 2024Updated last year
- ☆12Jul 4, 2024Updated last year
- ☆23May 5, 2026Updated last month
- ☆12Dec 6, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year
- [NeurIPS 2025 Spotlight] Official repository for "Web-Shepherd: Advancing PRMs for Reinforcing Web Agents"☆56May 21, 2025Updated last year
- ☆14Dec 25, 2024Updated last year
- An open-source personal academic homepage template characterized by its user-friendly design and extensive scalability.☆37Oct 6, 2025Updated 8 months ago
- FuseFL: One-Shot Federated Learning through the Lens of Causality with Progressive Model Fusion (NeurIPS 2024 Spotlight)☆15Mar 31, 2025Updated last year
- [ACL 2025] Official code for ''Learning to Reason from Feedback at Test-Time''.☆13May 16, 2025Updated last year
- Code for "Thinking Forward: Memory-Efficient Federated Finetuning of Language Models" (NeurIPS 2024). Spry is a federated learning al…☆12Oct 8, 2024Updated last year
- A Light-Weight And Interpretable Molecular Docking Model☆26Oct 23, 2024Updated last year
- ☆17May 29, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Advanced Embodied Intelligence Brain Model☆36Nov 5, 2025Updated 7 months ago
- [ICCV 2025] AdsQA: Towards Advertisement Video Understanding Arxiv: https://arxiv.org/abs/2509.08621☆35Oct 30, 2025Updated 7 months ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆19Jun 27, 2024Updated last year
- We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…☆305Jun 3, 2026Updated last week
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆29Dec 19, 2023Updated 2 years ago
- ☆22May 3, 2025Updated last year
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- PICABench: How Far Are We from Physically Realistic Image Editing?☆38Nov 5, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"☆12Dec 6, 2024Updated last year
- ☆22Jan 26, 2024Updated 2 years ago
- Gradient-based Next-best-view Planning☆18Nov 20, 2024Updated last year
- 复旦研究生入学教育测试☆23Aug 28, 2025Updated 9 months ago
- [NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effect…☆89Jun 17, 2024Updated last year
- MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, i…☆224May 6, 2026Updated last month
- [AAAI 2024] The Official implementation for 'SCD-Net: Spatiotemporal Clues Disentanglement Network for Self-supervised Skeleton-based Act…☆27Apr 20, 2024Updated 2 years ago
- ☆33Jul 10, 2025Updated 11 months ago
- ☆47Dec 9, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆26Sep 13, 2024Updated last year
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆55Nov 29, 2024Updated last year
- A novel Participation-Contributed Temporal Dynamic Model for Group Activity Recognition☆25Jan 24, 2021Updated 5 years ago
- [NAACL 2025] SIUO: Cross-Modality Safety Alignment☆125Jan 31, 2025Updated last year
- ☆19Dec 23, 2024Updated last year
- 《파이토치 트랜스포머를 활용한 자연어 처리와 컴퓨터비전 심층학습》 예제 코드☆45Feb 16, 2025Updated last year
- [ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"☆35Jun 23, 2025Updated 11 months ago