UMass-Embodied-AGI / TesserActLinks
ICCV 2025 | TesserAct: Learning 4D Embodied World Models
☆371Updated 5 months ago
Alternatives and similar repositories for TesserAct
Users that are interested in TesserAct are comparing it to the libraries listed below
Sorting:
- [CORL 2025 Oral]One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation.☆439Updated 5 months ago
- Ctrl-World: A Controllable Generative World Model for Robot Manipualtion☆244Updated last month
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆216Updated 2 months ago
- Towards a Generative 3D World Engine for Embodied Intelligence☆380Updated this week
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation☆260Updated 9 months ago
- VITRA: Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos☆248Updated 3 weeks ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆171Updated 6 months ago
- 🌐 3D and 4D World Modeling: A Survey☆760Updated 3 weeks ago
- PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation☆157Updated last week
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling☆556Updated 2 months ago
- This is the repository that contains source code for the PhysGen3D.☆237Updated 4 months ago
- ☆183Updated 5 months ago
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)☆314Updated 5 months ago
- [ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆364Updated 2 months ago
- Official implementation of "Next-Scale Autoregressive Models are Zero-Shot Single-Image Object View Synthesizers"☆46Updated 9 months ago
- Codebase for Automated Creation of Digital Cousins for Robust Policy Learning☆238Updated 9 months ago
- G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning☆242Updated last month
- [NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆121Updated 2 months ago
- [RSS 2025] Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation☆161Updated 7 months ago
- ☆153Updated last year
- ☆113Updated last week
- A Comprehensive Survey on World Models for Embodied AI☆184Updated 2 months ago
- [NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆273Updated last week
- Official Reporsitory of "EgoMono4D: Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos"☆40Updated 3 months ago
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆420Updated last week
- Cosmos-Transfer2.5, built on top of Cosmos-Predict2.5, produces high-quality world simulations conditioned on multiple spatial control in…☆331Updated last week
- OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆409Updated last week
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆174Updated 6 months ago
- Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model☆159Updated last week
- WoW (World-Omniscient World Model) is a generative world model trained on 2 million robotic interaction trajectories, designed to imagine…☆130Updated last week