facebookresearch/Action100M

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/Action100M)

facebookresearch / Action100M

A Large-scale Video Action Dataset

☆482

Alternatives and similar repositories for Action100M

Users that are interested in Action100M are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yangzhou24 / OmniWorld
View on GitHub
[ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling
☆485Apr 16, 2026Updated 3 months ago
NJU-3DV / SpatialVID
View on GitHub
[CVPR 2026] SpatialVID: A Large-Scale Video Dataset with Spatial Annotations
☆589Apr 22, 2026Updated 3 months ago
baaivision / Emu3.5
View on GitHub
Native Multimodal Models are World Learners
☆1,538Dec 30, 2025Updated 6 months ago
ByteDance-Seed / TraceAnything
View on GitHub
[ICLR 2026] Trace Anything: Representing Any Video in 4D via Trajectory Fields
☆543Oct 31, 2025Updated 8 months ago
NVIDIA / DreamDojo
View on GitHub
Official Codebase for "DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos" (ICML 2026)
☆1,008Mar 21, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
buoyancy99 / large-video-planner
View on GitHub
☆256Jan 31, 2026Updated 5 months ago
Tencent-Hunyuan / HY-WorldPlay
View on GitHub
HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency
☆1,560Jun 10, 2026Updated last month
Robbyant / lingbot-va
View on GitHub
[RSS 2026] Causal video-action world model for generalist robot control
☆1,672Jul 9, 2026Updated 2 weeks ago
thu-ml / Causal-Forcing
View on GitHub
[ICML 2026] Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactiv…
☆879Updated this week
Lixsp11 / sekai-codebase
View on GitHub
[NeurIPS 2025] Sekai: A Video Dataset towards World Exploration
☆302Jun 27, 2026Updated 3 weeks ago
Robbyant / lingbot-video
View on GitHub
Scaling Mixture-of-Experts Video Pretraining for Embodied Intelligence
☆868Jul 10, 2026Updated 2 weeks ago
facebookresearch / vjepa2
View on GitHub
PyTorch code and models for VJEPA2 self-supervised learning from video.
☆4,392Mar 23, 2026Updated 4 months ago
InternRobotics / Aether
View on GitHub
[ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling
☆604Oct 26, 2025Updated 9 months ago
GaTech-RL2 / EgoVerse
View on GitHub
EgoVerse: Egocentric Data for Robot Learning from Around the World
☆483Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
liruilong940607 / prope
View on GitHub
Cameras as Relative Positional Encoding
☆742Dec 18, 2025Updated 7 months ago
henry123-boy / SpaTrackerV2
View on GitHub
[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy
☆984Feb 27, 2026Updated 4 months ago
Robbyant / lingbot-world
View on GitHub
Advancing Open-source World Models
☆4,281Jul 9, 2026Updated 2 weeks ago
bytetriper / RAE
View on GitHub
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
☆1,978Feb 25, 2026Updated 5 months ago
TencentARC / RollingForcing
View on GitHub
[ICLR 2026] Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time
☆444Oct 31, 2025Updated 8 months ago
showlab / Olaf-World
View on GitHub
[ICML 2026] Orienting Latent Actions for Video World Modeling
☆117Apr 20, 2026Updated 3 months ago
cambrian-mllm / cambrian-s
View on GitHub
Cambrian-S: Towards Spatial Supersensing in Video
☆563Apr 3, 2026Updated 3 months ago
apple / ml-egodex
View on GitHub
EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video
☆347Aug 20, 2025Updated 11 months ago
NVlabs / cosmos-policy
View on GitHub
Cosmos Policy
☆837Jan 23, 2026Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
KlingAIResearch / MemFlow
View on GitHub
Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"
☆216Dec 29, 2025Updated 6 months ago
hwjiang1510 / RayZer
View on GitHub
Code for ICCV'2025 (Best student paper honorable mention) "RayZer: A Self-supervised Large View Synthesis Model"
☆444Nov 24, 2025Updated 8 months ago
JaydenLyh / Reward-Forcing
View on GitHub
[CVPR 2026 Highlight] Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation
☆352Dec 15, 2025Updated 7 months ago
TencentARC / VerseCrafter
View on GitHub
VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control
☆413Updated this week
IGL-HKUST / DiffusionAsShader
View on GitHub
[SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control
☆823Jun 9, 2025Updated last year
nv-tlabs / vipe
View on GitHub
ViPE: Video Pose Engine for Geometric 3D Perception
☆2,050Jun 9, 2026Updated last month
chengzhag / UCPE
View on GitHub
📷 [CVPR'26] Camera-controlled text-to-video generation, now with intrinsics, distortion and orientation control!
☆210May 15, 2026Updated 2 months ago
Any-4D / Any4D
View on GitHub
Any4D: Unified Feed-Forward Metric 4D Reconstruction
☆386Apr 17, 2026Updated 3 months ago
huangwl18 / PointWorld
View on GitHub
PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation
☆418Mar 11, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
xizaoqu / WorldMem
View on GitHub
[NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory
☆381Feb 21, 2026Updated 5 months ago
KlingAIResearch / UniVideo
View on GitHub
[ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos
☆541Jul 3, 2026Updated 3 weeks ago
xbyym / StableWorld
View on GitHub
StableWorld: Towards Stable and Consistent Long Interactive Video Generation
☆97Mar 18, 2026Updated 4 months ago
kwsong0113 / diffusion-forcing-transformer
View on GitHub
[ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"
☆705Jul 1, 2025Updated last year
dreamzero0 / dreamzero
View on GitHub
Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals
☆2,485Apr 19, 2026Updated 3 months ago
facebookresearch / tuna-2
View on GitHub
Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation
☆739Updated this week
Seed3D / Seed3D
View on GitHub
☆213Oct 22, 2025Updated 9 months ago