UMass-Embodied-AGI / TesserActLinks
ICCV 2025 | TesserAct: Learning 4D Embodied World Models
☆379Updated 6 months ago
Alternatives and similar repositories for TesserAct
Users that are interested in TesserAct are comparing it to the libraries listed below
Sorting:
- [CORL 2025 Oral]One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation.☆445Updated 5 months ago
- PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation☆342Updated 3 weeks ago
- ☆178Updated last week
- Ctrl-World: A Controllable Generative World Model for Robot Manipualtion☆259Updated 2 months ago
- VITRA: Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos☆285Updated last week
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation☆262Updated 10 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆219Updated 3 months ago
- G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning☆257Updated 3 weeks ago
- Causal video-action world model for generalist robot control☆289Updated this week
- This is the repository that contains source code for the PhysGen3D.☆240Updated 4 months ago
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling☆569Updated 3 months ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆173Updated 7 months ago
- 🌐 3D and 4D World Modeling: A Survey☆793Updated 2 weeks ago
- [ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆420Updated 3 weeks ago
- Towards a Generative 3D World Engine for Embodied Intelligence☆385Updated last week
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)☆325Updated 6 months ago
- ☆183Updated 6 months ago
- Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model☆175Updated 3 weeks ago
- [ICLR 2026] Trace Anything: Representing Any Video in 4D via Trajectory Fields☆467Updated 3 months ago
- ☆224Updated 4 months ago
- Codebase for Automated Creation of Digital Cousins for Robust Policy Learning☆238Updated 10 months ago
- Official implementation of "Next-Scale Autoregressive Models are Zero-Shot Single-Image Object View Synthesizers"☆45Updated 10 months ago
- ☆153Updated last year
- [NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆125Updated 3 months ago
- [Official] AstraNav-Memory: Contexts Compression for Long Memory. An image-centric memory framework for lifelong embodied navigation via …☆19Updated 2 weeks ago
- Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the …☆761Updated last week
- Cosmos-Transfer2.5, built on top of Cosmos-Predict2.5, produces high-quality world simulations conditioned on multiple spatial control in…☆422Updated last week
- [NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆282Updated 3 weeks ago
- Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets☆184Updated 3 months ago
- WoW (World-Omniscient World Model) is a generative world model trained on 2 million robotic interaction trajectories, designed to imagine…☆137Updated last month