flow-diffusion/AVDC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/flow-diffusion/AVDC)

flow-diffusion / AVDC

Official repository of Learning to Act from Actionless Videos through Dense Correspondences.

☆262

Alternatives and similar repositories for AVDC

Users that are interested in AVDC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

flow-diffusion / AVDC_experiments
View on GitHub
The official codebase for running the experiments described in the AVDC paper.
☆20Oct 2, 2024Updated last year
kvablack / susie
View on GitHub
Code for subgoal synthesis via image editing
☆158Oct 23, 2023Updated 2 years ago
video-language-planning / vlp_code
View on GitHub
☆81May 23, 2025Updated last year
anuragajay / hip
View on GitHub
Codebase for HiP
☆90Dec 15, 2023Updated 2 years ago
bytedance / GR-1
View on GitHub
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
☆310Apr 22, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
rainbow979 / robodreamer
View on GitHub
☆101Sep 4, 2024Updated last year
rail-berkeley / bridge_data_v2
View on GitHub
☆285Mar 17, 2024Updated 2 years ago
Large-Trajectory-Model / ATM
View on GitHub
Official codebase for "Any-point Trajectory Modeling for Policy Learning"
☆279Jun 19, 2025Updated last year
ShuangLI59 / unified_video_action
View on GitHub
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆400Jul 23, 2025Updated 11 months ago
zhouxian / act3d-chained-diffuser
View on GitHub
A unified architecture for multimodal multi-task robotic policy learning.
☆184Feb 2, 2024Updated 2 years ago
simpler-env / SimplerEnv
View on GitHub
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…
☆1,126Dec 20, 2025Updated 7 months ago
HeegerGao / FLIP
View on GitHub
Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks
☆85Dec 12, 2024Updated last year
cvlab-columbia / dreamitate
View on GitHub
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)
☆59Jun 7, 2025Updated last year
nickgkan / 3d_diffuser_actor
View on GitHub
Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"
☆391Aug 17, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
octo-models / octo
View on GitHub
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
☆1,711Jul 31, 2024Updated last year
facebookresearch / r3m
View on GitHub
Pre-training Reusable Representations for Robotic Manipulation Using Diverse Human Video Data
☆379Mar 21, 2023Updated 3 years ago
UMass-Embodied-AGI / 3D-VLA
View on GitHub
[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model
☆627Oct 29, 2024Updated last year
OpenDriveLab / MPI
View on GitHub
[RSS 2024] Learning Manipulation by Predicting Interaction
☆120Jul 2, 2025Updated last year
google-research / language-table
View on GitHub
Suite of human-collected datasets and a multi-task continuous control benchmark for open vocabulary visuolinguomotor learning.
☆362Jul 2, 2026Updated 2 weeks ago
j96w / MimicPlay
View on GitHub
"MimicPlay: Long-Horizon Imitation Learning by Watching Human Play" code repository
☆312Apr 23, 2024Updated 2 years ago
ayushjain1144 / ebmplanner
View on GitHub
Code for the RSS 2023 paper "Energy-based Models are Zero-Shot Planners for Compositional Scene Rearrangement"
☆21Jul 4, 2023Updated 3 years ago
penn-pal-lab / LIV
View on GitHub
Official repository for "LIV: Language-Image Representations and Rewards for Robotic Control" (ICML 2023)
☆135Oct 19, 2023Updated 2 years ago
homangab / Track-2-Act
View on GitHub
code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation
☆105Jul 31, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Streaming-Diffusion-Policy / streaming_diffusion_policy
View on GitHub
Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models
☆79May 14, 2025Updated last year
jayLEE0301 / vq_bet_official
View on GitHub
Official code for "Behavior Generation with Latent Actions" (ICML 2024 Spotlight)
☆212Feb 28, 2024Updated 2 years ago
mees / calvin
View on GitHub
CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
☆960Sep 8, 2025Updated 10 months ago
intuitive-robots / mdt_policy
View on GitHub
[RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre…
☆170Oct 16, 2024Updated last year
MohitShridhar / genima
View on GitHub
Official Code Repo for GENIMA
☆77Oct 29, 2025Updated 8 months ago
liruiw / HPT
View on GitHub
Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.
☆542Dec 6, 2024Updated last year
LatentActionPretraining / LAPA
View on GitHub
[ICLR 2025] LAPA: Latent Action Pretraining from Videos
☆560Jan 22, 2025Updated last year
droid-dataset / droid_policy_learning
View on GitHub
DROID Policy Learning and Evaluation
☆290Apr 22, 2025Updated last year
WangYixuan12 / d3fields
View on GitHub
[CoRL 24 Oral] D^3Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangement
☆185Nov 2, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
bytedance / IRASim
View on GitHub
☆159Jul 8, 2025Updated last year
1x-technologies / 1xgpt
View on GitHub
world modeling challenge for humanoid robots
☆564Nov 8, 2024Updated last year
thuml / iVideoGPT
View on GitHub
Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223
☆186Sep 23, 2025Updated 9 months ago
ademiadeniji / lamp
View on GitHub
☆47Jan 29, 2024Updated 2 years ago
facebookresearch / vip
View on GitHub
Official repository for "VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training"
☆187Oct 19, 2023Updated 2 years ago
moka-manipulation / moka
View on GitHub
MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)
☆101Jul 16, 2024Updated 2 years ago
bytedance / GR-MG
View on GitHub
Official implementation of GR-MG
☆90Jan 12, 2025Updated last year