UMass-Embodied-AGI/TesserAct

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UMass-Embodied-AGI/TesserAct)

UMass-Embodied-AGI / TesserAct

ICCV 2025 | TesserAct: Learning 4D Embodied World Models

☆404

Alternatives and similar repositories for TesserAct

Users that are interested in TesserAct are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

InternRobotics / Aether
View on GitHub
[ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling
☆604Oct 26, 2025Updated 9 months ago
lzylucy / 4dgen
View on GitHub
[ICLR 2026] Codebase for paper "Geometry-aware 4D Video Generation for Robot Manipulation"
☆123Jan 10, 2026Updated 6 months ago
KlingAIResearch / RoboMaster
View on GitHub
[ICLR’26] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control
☆107Feb 8, 2026Updated 5 months ago
ShuangLI59 / unified_video_action
View on GitHub
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆400Jul 23, 2025Updated last year
jzr99 / Geo4D
View on GitHub
[ICCV 2025 Highlight] Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction
☆437Jun 6, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yangzhou24 / OmniWorld
View on GitHub
[ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling
☆486Apr 16, 2026Updated 3 months ago
ByteDance-Seed / TraceAnything
View on GitHub
[ICLR 2026] Trace Anything: Representing Any Video in 4D via Trajectory Fields
☆543Oct 31, 2025Updated 9 months ago
SunYangtian / UniGeo
View on GitHub
UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation
☆136Jun 10, 2025Updated last year
wzzheng / StreamVGGT
View on GitHub
[ICLR 2026] Streaming 4D Visual Geometry Transformer
☆948Oct 27, 2025Updated 9 months ago
buoyancy99 / large-video-planner
View on GitHub
☆256Jan 31, 2026Updated 6 months ago
dreamzero0 / dreamzero
View on GitHub
Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals
☆2,509Apr 19, 2026Updated 3 months ago
roboterax / video-prediction-policy
View on GitHub
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io
☆407May 17, 2025Updated last year
henry123-boy / SpaTrackerV2
View on GitHub
[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy
☆987Feb 27, 2026Updated 5 months ago
SOTAMak1r / DeepVerse
View on GitHub
DeepVerse: 4D Autoregressive Video Generation as a World Model
☆230Aug 11, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
CUT3R / CUT3R
View on GitHub
Official implementation of Continuous 3D Perception Model with Persistent State
☆1,471Aug 27, 2025Updated 11 months ago
Robert-gyj / Ctrl-World
View on GitHub
ICLR 2026 Paper: Ctrl-World
☆538Apr 8, 2026Updated 3 months ago
Robbyant / lingbot-va
View on GitHub
[RSS 2026] Causal video-action world model for generalist robot control
☆1,700Jul 9, 2026Updated 3 weeks ago
yyfz / Pi3
View on GitHub
[ICLR 2026] π^3: Permutation-Equivariant Visual Geometry Learning
☆2,096Jul 3, 2026Updated 3 weeks ago
THU-SI / LangScene-X
View on GitHub
[ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
☆302Jul 15, 2025Updated last year
AgibotTech / Genie-Envisioner-V1
View on GitHub
☆565Jun 24, 2026Updated last month
HaoyiZhu / SPA
View on GitHub
[ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
☆177Jun 19, 2025Updated last year
UMass-Embodied-AGI / 3D-VLA
View on GitHub
[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model
☆630Oct 29, 2024Updated last year
Davidyao99 / uni4d
View on GitHub
[CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video
☆225May 25, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
UMass-Embodied-AGI / ActionImages
View on GitHub
☆71Jun 9, 2026Updated last month
liruilong940607 / prope
View on GitHub
Cameras as Relative Positional Encoding
☆744Dec 18, 2025Updated 7 months ago
RoboVerseOrg / RoboVerse
View on GitHub
RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
☆1,792Updated this week
xizaoqu / WorldMem
View on GitHub
[NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory
☆381Feb 21, 2026Updated 5 months ago
ant-research / FLARE
View on GitHub
☆721May 1, 2025Updated last year
Stereo4d / stereo4d-code
View on GitHub
Stereo4D dataset and processing code
☆311Nov 4, 2025Updated 8 months ago
LatentActionPretraining / LAPA
View on GitHub
[ICLR 2025] LAPA: Latent Action Pretraining from Videos
☆562Jan 22, 2025Updated last year
UMass-Embodied-AGI / MindJourney
View on GitHub
[NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"
☆151Nov 4, 2025Updated 8 months ago
Junyi42 / monst3r
View on GitHub
Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"
☆1,383Jun 16, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
THU-SI / Spatial-MLLM
View on GitHub
[NeurIPS 2025 Spotlight] Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence
☆481Feb 5, 2026Updated 5 months ago
YkiWu / Point3R
View on GitHub
[NeurIPS 2025] Streaming 3D Reconstruction with Explicit Spatial Pointer Memory
☆193Mar 10, 2026Updated 4 months ago
nvidia-cosmos / cosmos-predict2.5
View on GitHub
Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the …
☆1,338Jun 8, 2026Updated last month
yukangcao / Awesome-4D-Spatial-Intelligence
View on GitHub
A curated list of awesome papers for reconstructing 4D spatial intelligence from video. (arXiv 2507.21045)
☆515Jun 5, 2026Updated last month
chenguolin / MoVieS
View on GitHub
[CVPR 2026] Official implementation of "MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second".
☆463Mar 19, 2026Updated 4 months ago
UMass-Embodied-AGI / Articulate-Anymesh
View on GitHub
☆137May 13, 2025Updated last year
NJU-3DV / SpatialVID
View on GitHub
[CVPR 2026] SpatialVID: A Large-Scale Video Dataset with Spatial Annotations
☆589Apr 22, 2026Updated 3 months ago