H-Freax / Awesome-Video-Robotic-PapersLinks

This repository compiles a list of papers related to the application of video technology in the field of robotics! Star⭐ the repo and follow me if you like what you see🤩.

☆168

Alternatives and similar repositories for Awesome-Video-Robotic-Papers

Users that are interested in Awesome-Video-Robotic-Papers are comparing it to the libraries listed below

Sorting:

aiming-lab / GRAPE
GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization
☆151Updated 7 months ago
allenai / molmoact
Official Repository for MolmoAct
☆254Updated 3 weeks ago
kvablack / susie
Code for subgoal synthesis via image editing
☆144Updated 2 years ago
OpenMOSS / VLABench
Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.
☆332Updated last week
Stanford-ILIAD / openvla-mini
OpenVLA: An open-source vision-language-action model for robotic manipulation.
☆293Updated 8 months ago
LostXine / LLaRA
[ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
☆225Updated 7 months ago
MichalZawalski / embodied-CoT
Embodied Chain of Thought: A robotic policy that reason to solve the task.
☆323Updated 7 months ago
rail-berkeley / bridge_data_v2
☆238Updated last year
LatentActionPretraining / LAPA
[ICLR 2025] LAPA: Latent Action Pretraining from Videos
☆400Updated 9 months ago
Fanqi-Lin / Data-Scaling-Laws
Official implementation of "Data Scaling Laws in Imitation Learning for Robotic Manipulation"
☆195Updated last year
flow-diffusion / AVDC
Official repository of Learning to Act from Actionless Videos through Dense Correspondences.
☆233Updated last year
BeingBeyond / Being-H0
Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos
☆179Updated 2 months ago
ShuangLI59 / unified_video_action
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆291Updated 3 months ago
Max-Fu / icrt
[ICRA 2025] In-Context Imitation Learning via Next-Token Prediction
☆100Updated 8 months ago
Max-Fu / otter
[ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
☆110Updated 7 months ago
RayYoh / OCRM_survey
A Survey of Embodied Learning for Object-Centric Robotic Manipulation
☆244Updated last year
wentaoyuan / RoboPoint
A Vision-Language Model for Spatial Affordance Prediction in Robotics
☆202Updated 4 months ago
H-Freax / ThinkGrasp
[CoRL2024] ThinkGrasp: A Vision-Language System for Strategic Part Grasping in Clutter. https://arxiv.org/abs/2407.11298
☆102Updated 3 months ago
arnold-benchmark / arnold
[ICCV 2023] Official code repository for ARNOLD benchmark
☆176Updated 8 months ago
WEIRDLabUW / unified-world-model
Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets
☆150Updated last month
Large-Trajectory-Model / ATM
Official codebase for "Any-point Trajectory Modeling for Policy Learning"
☆262Updated 5 months ago
lmzpai / roboMamba
The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`
☆139Updated 10 months ago
EDiRobotics / GR1-Training
Reimplementation of GR-1, a generalized policy for robotics manipulation.
☆144Updated last year
2toinf / UniAct
[CVPR 2025] The offical Implementation of "Universal Actions for Enhanced Embodied Foundation Models"
☆212Updated 2 weeks ago
Fanqi-Lin / OneTwoVLA
Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"
☆198Updated 5 months ago
yueyang130 / DeeR-VLA
Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"
☆117Updated 9 months ago
DelinQu / awesome-vision-language-action-model
Latest Advances on Vison-Language-Action Models.
☆119Updated 8 months ago
embodiedreasoning / ERQA
Embodied Reasoning Question Answer (ERQA) Benchmark
☆241Updated 8 months ago
InternRobotics / VLAC
VLAC: A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning
☆222Updated last month
OpenHelix-Team / OpenHelix
OpenHelix: An Open-source Dual-System VLA Model for Robotic Manipulation
☆323Updated 2 months ago