sinwang20/D2PO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sinwang20/D2PO)

sinwang20 / D2PO

[ACL 2025] "World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning." https://arxiv.org/abs/2503.10480

☆18

Alternatives and similar repositories for D2PO

Users that are interested in D2PO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xinghaow99 / prism
View on GitHub
[ICML 2026] Prism: Spectral-Aware Block-Sparse Attention
☆27May 22, 2026Updated last month
inst-it / inst-it
View on GitHub
[NeurIPS 2025] The official repository of "Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tun…
☆40Feb 20, 2025Updated last year
baopj / Vid-Morp
View on GitHub
☆12Dec 6, 2024Updated last year
RoboticSJTU / UniDomain
View on GitHub
[NeurIPS 2025] Official implementation of "UniDomain: Pretraining a Unified PDDL Domain from Real-World Demonstrations for Generalizable …
☆22May 20, 2026Updated 2 months ago
baopj / E3M
View on GitHub
[ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.
☆11Jul 16, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ljang0 / videowebarena
View on GitHub
☆14Dec 25, 2024Updated last year
deep-learning-with-projects / deep-learning-with-projects
View on GitHub
☆17May 29, 2022Updated 4 years ago
ytaek-oh / vl_compo
View on GitHub
☆10Jul 5, 2024Updated 2 years ago
LaVi-Lab / FTTT
View on GitHub
[ACL 2025] Official code for ''Learning to Reason from Feedback at Test-Time''.
☆13May 16, 2025Updated last year
HanSolo9682 / CounterCurate
View on GitHub
This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.
☆19Jun 27, 2024Updated 2 years ago
Phospheneser / Phospheneser-awesome-academic-template
View on GitHub
An open-source personal academic homepage template characterized by its user-friendly design and extensive scalability.
☆37Oct 6, 2025Updated 9 months ago
BeastyZ / LLM-Verified-Retrieval
View on GitHub
Repo for Llatrieval
☆32Aug 21, 2024Updated last year
kyle8581 / Web-Shepherd
View on GitHub
[NeurIPS 2025 Spotlight] Official repository for "Web-Shepherd: Advancing PRMs for Reinforcing Web Agents"
☆58May 21, 2025Updated last year
thunlp / NOSA
View on GitHub
The official implementation of NOSA
☆19Jun 11, 2026Updated last month
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
liziniu / policy_optimization
View on GitHub
Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)
☆29Dec 19, 2023Updated 2 years ago
Lux0926 / ASPRM
View on GitHub
AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence
☆10Mar 2, 2025Updated last year
hkust-nlp / GUIMid
View on GitHub
☆22May 3, 2025Updated last year
EIT-NLP / Speak-While-Watching
View on GitHub
☆17Mar 1, 2026Updated 4 months ago
TiankaiHang / CCA
View on GitHub
☆22Jan 26, 2024Updated 2 years ago
Vancouver-wen / ibooking
View on GitHub
基于django开发的自习室预约系统
☆10Nov 12, 2024Updated last year
EsYoon7 / RLHF-TLCR
View on GitHub
[ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"
☆12Dec 6, 2024Updated last year
OpenMOSS / ABC-Bench
View on GitHub
ABC-Bench is a benchmark for Agentic Backend Coding. It evaluates whether code agents can explore real repositories, edit code, configure…
☆33Jan 20, 2026Updated 6 months ago
dannyXSC / Fudan_FreshmanTest
View on GitHub
复旦研究生入学教育测试
☆23Aug 28, 2025Updated 10 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
MengLcool / SEGIC
View on GitHub
[ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".
☆27Oct 13, 2024Updated last year
ShareLab-SII / CaTok
View on GitHub
[CVPR-26] Official repository of "CaTok: Taming Mean Flows for One-Dimensional Causal Image Tokenization"
☆19Mar 9, 2026Updated 4 months ago
kaist-ami / BEAF
View on GitHub
[ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"
☆22Mar 26, 2025Updated last year
wdrink / OpenTokenizer
View on GitHub
☆21Jan 17, 2025Updated last year
OpenMOSS / MOSS-Audio-Tokenizer
View on GitHub
MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, i…
☆245Jun 16, 2026Updated last month
stegmuel / ScoreNet
View on GitHub
Official implementation of "ScoreNet: Learning Non-Uniform Attention and Augmentation for Transformer-Based Histopathological Image Class…
☆12Mar 6, 2023Updated 3 years ago
WooooDyy / MathCritique
View on GitHub
Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".
☆55Nov 29, 2024Updated last year
xinwei96 / CoIn_dialogRE
View on GitHub
Source codes and data for our IJCAI 2021 paper "Consistent Inference for Dialogue Relation Extraction".
☆24Nov 27, 2021Updated 4 years ago
w568w / AscendC-clangd-demo
View on GitHub
一个利用 clangd 开发昇腾算子的 demo。
☆15Sep 3, 2025Updated 10 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
QizaoWang / CAMC-CCReID
View on GitHub
Co-Attention Aligned Mutual Cross-Attention for Cloth-Changing Person Re-Identification [ACCV 2022 Oral]
☆17Dec 26, 2024Updated last year
OpenMOSS / MOSS-VL
View on GitHub
MOSS-VL is the core multimodal model series within the OpenMOSS ecosystem, dedicated to visual understanding.
☆385Updated this week
SeanLeng1 / Reward-Calibration
View on GitHub
☆21Dec 14, 2024Updated last year
ChenHsing / VIDiff
View on GitHub
☆39Dec 4, 2023Updated 2 years ago
rookie-joe / AutoPSV
View on GitHub
☆50Oct 28, 2024Updated last year
subrtadel / DIA
View on GitHub
☆20Sep 13, 2023Updated 2 years ago
Qinying-Liu / Awesome-omni-modal-understanding
View on GitHub
Collection of papers about video-audio understanding
☆25Dec 26, 2025Updated 6 months ago