A Large-scale Video Action Dataset
☆438Jan 16, 2026Updated 2 months ago
Alternatives and similar repositories for Action100M
Users that are interested in Action100M are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2026] MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE☆139Mar 13, 2026Updated last week
- Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward☆60Nov 27, 2025Updated 4 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆234Oct 17, 2025Updated 5 months ago
- Code for ICCV'2025 (Best student paper honorable mention) "RayZer: A Self-supervised Large View Synthesis Model"☆419Nov 24, 2025Updated 4 months ago
- [ICLR 2026] Trace Anything: Representing Any Video in 4D via Trajectory Fields☆517Oct 31, 2025Updated 4 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos☆473Feb 11, 2026Updated last month
- ☆12Jul 22, 2025Updated 8 months ago
- [ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆444Feb 25, 2026Updated last month
- ☆25Jun 12, 2025Updated 9 months ago
- Official implementation of "Repurposing Geometric Foundation Models for Multi-view Diffusion"☆113Updated this week
- [ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos☆167Oct 1, 2025Updated 5 months ago
- PyTorch implementation of "HERO: Human Reaction Generation from Videos (ICCV 2025)"☆31Jan 6, 2026Updated 2 months ago
- [CVPR 2026] SpatialVID: A Large-Scale Video Dataset with Spatial Annotations☆521Mar 1, 2026Updated 3 weeks ago
- Awesome latest models, datasets and benchmarks on streaming/online video understanding.☆24Oct 19, 2025Updated 5 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- A list of works on video generation towards world model☆433Mar 18, 2026Updated last week
- Tools for the Embody 3D Dataset☆231Oct 30, 2025Updated 4 months ago
- ReSemAct: Advancing Fine-Grained Robotic Manipulation via Semantic Structuring and Affordance Refinement☆17Jan 5, 2026Updated 2 months ago
- [CVPR2026] Scaling Spatial Intelligence with Multimodal Foundation Models☆184Mar 19, 2026Updated last week
- An unified model for 4D human-scene reconstruction☆456Dec 30, 2025Updated 2 months ago
- Krea Realtime 14B. An open-source realtime AI video model.☆513Nov 13, 2025Updated 4 months ago
- [ICLR 2026] RoboInter: A Holistic Intermediate Representation Suite Towards Robotic Manipulation☆107Feb 14, 2026Updated last month
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!☆2,212Mar 12, 2026Updated 2 weeks ago
- Official repository of paper "Mojito: LLM-Aided Motion Instructor with Jitter-Reduced Inertial Tokens".☆21May 12, 2025Updated 10 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control☆813Jun 9, 2025Updated 9 months ago
- [NeurIPS 2024] DN-4DGS: Denoised Deformable Network with Temporal-Spatial Aggregation for Dynamic Scene Rendering☆13Oct 22, 2024Updated last year
- ☆17Jul 24, 2025Updated 8 months ago
- Official repository for "Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models", https://arxiv.org/abs/2601.1983…☆85Mar 9, 2026Updated 2 weeks ago
- [ICRA 2026] UltraDexGrasp: Learning Universal Dexterous Grasping for Bimanual Robots with Synthetic Data☆51Mar 6, 2026Updated 3 weeks ago
- CVPR2025: Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning☆38Mar 21, 2025Updated last year
- [ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…☆153Jul 24, 2025Updated 8 months ago
- DeepVerse: 4D Autoregressive Video Generation as a World Model☆221Aug 11, 2025Updated 7 months ago
- Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Vi…☆241Mar 19, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆13Mar 5, 2025Updated last year
- [NeurIPS 2025] Frame In-N-Out: Unbounded Controllable Image-to-Video Generation☆30Jan 5, 2026Updated 2 months ago
- Official implementation of AMPLIFY: Actionless Motion Priors for Robot Learning from Videos☆45Feb 26, 2026Updated last month
- ☆80Nov 4, 2025Updated 4 months ago
- Use Blender for figures.☆15Feb 11, 2026Updated last month
- ☆27Mar 3, 2025Updated last year
- [ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆374Oct 21, 2025Updated 5 months ago