buoyancy99 / large-video-plannerLinks
☆25Updated this week
Alternatives and similar repositories for large-video-planner
Users that are interested in large-video-planner are comparing it to the libraries listed below
Sorting:
- Implementation of Prompting with the Future: Open-World Model Predictive Control with Interactive Digital Twins. [RSS 2025]☆45Updated 2 months ago
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆90Updated last year
- Codebase for paper "Geometry-aware 4D Video Generation for Robot Manipulation"☆67Updated 2 weeks ago
- View-Invariant Policy Learning via Zero-Shot Novel View Synthesis (CoRL 2024)☆26Updated 2 months ago
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆91Updated 6 months ago
- ☆43Updated 5 months ago
- [CVPR 2025] VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation☆40Updated 6 months ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆171Updated 6 months ago
- MAPLE infuses dexterous manipulation priors from egocentric videos into vision encoders, making their features well-suited for downstream…☆28Updated 2 weeks ago
- Splat-MOVER: Multi-Stage, Open-Vocabulary Robotic Manipulation via Editable Gaussian Splatting☆40Updated last year
- [CoRL 2025] Robot Learning from Any Images☆34Updated last month
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆204Updated 2 months ago
- [ICCV'25] Towards Scalable Gaussian World Models for Robotic Manipulation☆65Updated 2 months ago
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆60Updated 8 months ago
- ☆137Updated 8 months ago
- Mirage: a zero-shot cross-embodiment policy transfer method. Benchmarking code for cross-embodiment policy transfer.☆31Updated last year
- [NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆115Updated last month
- [ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration☆58Updated 7 months ago
- [CVPR 2024] GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction☆76Updated 5 months ago
- ☆21Updated 7 months ago
- ☆86Updated 3 months ago
- Official implementation of "Re3Sim: Generating High-Fidelity Simulation Data via 3D-Photorealistic Real-to-Sim for Robotic Manipulation"☆127Updated 3 months ago
- Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).☆45Updated 6 months ago
- Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model☆145Updated 2 weeks ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆79Updated last year
- Official Implementation for “CordViP: Correspondence-based Visuomotor Policy for Dexterous Manipulation in Real-World” (RSS 2025).☆40Updated 3 weeks ago
- Official Implementation of ARM4R ICML 2025☆52Updated 3 months ago
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction☆42Updated 3 months ago
- Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.☆52Updated last month
- ☆65Updated 5 months ago