UMass-Embodied-AGI / TesserActLinks
ICCV 2025 | TesserAct: Learning 4D Embodied World Models
β367Updated 4 months ago
Alternatives and similar repositories for TesserAct
Users that are interested in TesserAct are comparing it to the libraries listed below
Sorting:
- [CORL 2025 Oral]One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation.β431Updated 4 months ago
- π 3D and 4D World Modeling: A Surveyβ742Updated last week
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulationβ256Updated 9 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.β204Updated 2 months ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representationβ171Updated 6 months ago
- Ctrl-World: A Controllable Generative World Model for Robot Manipualtionβ220Updated 3 weeks ago
- VITRA: Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videosβ217Updated this week
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modelingβ554Updated 2 months ago
- β179Updated 5 months ago
- OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modelingβ406Updated this week
- Towards a Generative 3D World Engine for Embodied Intelligenceβ365Updated 3 weeks ago
- β152Updated last year
- Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Modelβ145Updated 2 weeks ago
- G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoningβ226Updated 3 weeks ago
- Trace Anything: Representing Any Video in 4D via Trajectory Fieldsβ435Updated last month
- This is the repository that contains source code for the PhysGen3D.β236Updated 3 months ago
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering andβ¦β60Updated 8 months ago
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligenceβ411Updated 3 weeks ago
- [RSS 2025] Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulationβ156Updated 6 months ago
- [ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D Worldβ358Updated 2 months ago
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulationβ172Updated 6 months ago
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstructionβ310Updated 3 months ago
- β79Updated 4 months ago
- Official Reporsitory of "EgoMono4D: Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos"β39Updated 3 months ago
- WoW (World-Omniscient World Model) is a generative world model trained on 2 million robotic interaction trajectories, designed to imagineβ¦β122Updated 3 weeks ago
- β137Updated 8 months ago
- [ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".β183Updated 6 months ago
- Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasetsβ169Updated 2 months ago
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)β309Updated 5 months ago
- Official implementation of NeurIPS 2025 paper "SceneWeaver: All-in-One 3D Scene Synthesis with an Extensible and Self-Reflective Agent"β107Updated last month