video-to-action / video-to-action-release
View external linksLinks

[ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration

☆60

Alternatives and similar repositories for video-to-action-release

Users that are interested in video-to-action-release are comparing it to the libraries listed below

Sorting:

video-to-action / v2a-video-model-release
View on GitHub
☆14May 4, 2025Updated 9 months ago
devinluo27 / comp_diffuser_release
View on GitHub
[NeurIPS 2025 Spotlight] Generative Trajectory Stitching through Diffusion Composition
☆68Sep 6, 2025Updated 5 months ago
yanweiw / itps
View on GitHub
☆45Apr 2, 2025Updated 10 months ago
devinluo27 / potential-motion-plan-release
View on GitHub
[ICML 2024] Potential Based Diffusion Motion Planning
☆140Sep 6, 2025Updated 5 months ago
jmwang0117 / Video4Robot
View on GitHub
List of papers on video-centric robot learning
☆22Nov 16, 2024Updated last year
Video-as-Agent / VideoAgent
View on GitHub
Official implementation of "Self-Improving Video Generation"
☆77Apr 25, 2025Updated 9 months ago
Streaming-Diffusion-Policy / streaming_diffusion_policy
View on GitHub
Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models
☆75May 14, 2025Updated 9 months ago
jsu27 / decomp_diffusion
View on GitHub
[ICML 2024] Compositional Image Decomposition with Diffusion Models
☆53Jul 7, 2024Updated last year
video-language-planning / vlp_code
View on GitHub
☆78May 23, 2025Updated 8 months ago
Owen718 / LongPrompt-LLamaGen
View on GitHub
This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…
☆30Oct 21, 2024Updated last year
gorkaydemir / track_on
View on GitHub
[ICLR 2025] Track-On: Transformer-based Online Point Tracking with Memory, and [arXiv 2025] Track-On2: Enhancing Online Point Tracking wi…
☆93Dec 20, 2025Updated last month
cfeng16 / this-and-that
View on GitHub
☆18Jul 9, 2024Updated last year
LatentActionPretraining / LAPA
View on GitHub
[ICLR 2025] LAPA: Latent Action Pretraining from Videos
☆460Jan 22, 2025Updated last year
ShuangLI59 / unified_video_action
View on GitHub
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆332Jul 23, 2025Updated 6 months ago
buoyancy99 / diffusion-forcing
View on GitHub
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
☆1,159Nov 9, 2025Updated 3 months ago
flow-diffusion / AVDC
View on GitHub
Official repository of Learning to Act from Actionless Videos through Dense Correspondences.
☆247Apr 25, 2024Updated last year
homangab / Track-2-Act
View on GitHub
code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation
☆100Jul 31, 2024Updated last year
rozumden / tracking-by-3d
View on GitHub
[ICCV 2023] Tracking by 3D Model Estimation of Unknown Objects in Videos
☆22Sep 26, 2023Updated 2 years ago
clear-nus / ltldog
View on GitHub
☆13Dec 17, 2025Updated 2 months ago
yilundu / ired_code_release
View on GitHub
☆88Jun 14, 2024Updated last year
buoyancy99 / research-template
View on GitHub
An ML research template with good documentation by Boyuan Chen, an MIT PhD student
☆128Mar 4, 2025Updated 11 months ago
HeegerGao / FLIP
View on GitHub
Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks
☆79Dec 12, 2024Updated last year
zju3dv / EgoAgent
View on GitHub
Official implementation of ICCV 2025 paper "EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds".
☆44Jun 30, 2025Updated 7 months ago
Hoyyyaard / 3DFlowAction
View on GitHub
☆47Jul 6, 2025Updated 7 months ago
MohitShridhar / genima
View on GitHub
Official Code Repo for GENIMA
☆77Oct 29, 2025Updated 3 months ago
lzylucy / 4dgen
View on GitHub
[ICLR 2026] Codebase for paper "Geometry-aware 4D Video Generation for Robot Manipulation"
☆84Jan 10, 2026Updated last month
horipse01 / 3d-foundation-policy
View on GitHub
☆88Sep 23, 2025Updated 4 months ago
Nut-World / NutWorld
View on GitHub
Seeing World Dynamics in a Nutshell
☆112Mar 18, 2025Updated 10 months ago
Large-Trajectory-Model / ATM
View on GitHub
Official codebase for "Any-point Trajectory Modeling for Policy Learning"
☆271Jun 19, 2025Updated 7 months ago
ManiCM-fast / ManiCM
View on GitHub
ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation
☆122May 8, 2025Updated 9 months ago
rainbow979 / robodreamer
View on GitHub
☆92Sep 4, 2024Updated last year
liruiw / Fleet-Tools
View on GitHub
Tool-use Robotic Benchmark built with Drake Simulation
☆29Jul 9, 2024Updated last year
seanywang0408 / RadianceMapping
View on GitHub
Official code of AAAI'23 paper: Boosting Point Clouds Rendering via Radiance Mapping written in PyTorch
☆28Aug 2, 2023Updated 2 years ago
Zerg-Overmind / Can3Tok
View on GitHub
Official code for the paper: Can3Tok (ICCV2025)
☆39Aug 23, 2025Updated 5 months ago
ZibinDong / AlignDiff-ICLR2024
View on GitHub
☆32Mar 10, 2024Updated last year
Stanford-TML / SpringGrasp_release
View on GitHub
Official implemnetation of SpringGrasp: Synthesizing Compliant, Dexterous Grasp under Shape Uncertainty
☆34Oct 15, 2025Updated 4 months ago
PingchuanMa / PingchuanMa.github.io
View on GitHub
Source code for my homepage.
☆14Nov 25, 2025Updated 2 months ago
InternRobotics / Seer
View on GitHub
[ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
☆279Jul 8, 2025Updated 7 months ago
real-stanford / im2Flow2Act
View on GitHub
[CoRL 2024] Im2Flow2Act: Flow as the Cross-domain Manipulation Interface
☆150Oct 17, 2024Updated last year

video-to-action / video-to-action-releaseView external linksLinks

Alternatives and similar repositories for video-to-action-release

video-to-action / video-to-action-release
View external linksLinks