hzxie/DynamicVLA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hzxie/DynamicVLA)

hzxie / DynamicVLA

The official implementation of "DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation". (arXiv 2601.22153)

☆151

Alternatives and similar repositories for DynamicVLA

Users that are interested in DynamicVLA are comparing it to the libraries listed below

Sorting:

PKU-HMI-Lab / AC-DiT
View on GitHub
AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation
☆31Updated this week
DravenALG / awesome-vla
View on GitHub
A Curated List of Vision-Language-Action (VLA) Research
☆61Updated this week
clvrai / N2M
View on GitHub
N2M: Bridging Navigation and Manipulation by Learning Initial Pose Preference from Rollout
☆28Nov 21, 2025Updated 3 months ago
Li-Hao-yuan / GeoThinker
View on GitHub
☆29Feb 12, 2026Updated 2 weeks ago
scy-v / ReSemAct
View on GitHub
ReSemAct: Advancing Fine-Grained Robotic Manipulation via Semantic Structuring and Affordance Refinement
☆17Jan 5, 2026Updated last month
JudgementH / RefAny3D
View on GitHub
[ICLR 2026] RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation
☆30Feb 5, 2026Updated 3 weeks ago
YixiangChen515 / EC-Flow
View on GitHub
[ICCV 2025] Official repo of "EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow"
☆27Oct 16, 2025Updated 4 months ago
FieldGen / FieldGen
View on GitHub
FieldGen is a semi-automatic data generation framework that enables scalable collection of diverse, high-quality real-world manipulation …
☆25Oct 28, 2025Updated 4 months ago
csguoh / DummyForcing
View on GitHub
Minute-long video generation at 24FPS.
☆50Feb 2, 2026Updated 3 weeks ago
Liang-ZX / DexHandDiff
View on GitHub
[CVPR'2025] "DexHandDiff: Interaction-aware Diffusion Planning for Adaptive Dexterous Manipulation"
☆20Jul 3, 2025Updated 7 months ago
LogosRoboticsGroup / ProphRL
View on GitHub
Reinforcing Action Policies by Prophesying
☆40Nov 26, 2025Updated 3 months ago
JasonQSY / AffordanceLLM
View on GitHub
Code for "AffordanceLLM: Grounding Affordance from Vision Language Models"
☆14Oct 18, 2024Updated last year
shengliangd / StereoVLA
View on GitHub
StereoVLA is powered by stereo vision and supports flexible deployment with high tolerance to camera pose variations.
☆52Jan 12, 2026Updated last month
NVlabs / LocateAnything3D
View on GitHub
☆33Nov 26, 2025Updated 3 months ago
OpenGalaxea / GalaxeaDP
View on GitHub
Galaxea's first diffusion policy release
☆38Aug 18, 2025Updated 6 months ago
ziplab / CoV
View on GitHub
CoV: Chain-of-View Prompting for Spatial Reasoning
☆51Jan 23, 2026Updated last month
Selen-Suyue / DSPv2
View on GitHub
[ICRA 2026] 🌠 DSPv2: Improved Dense Policy for Effective and Generalizable Whole-body Mobile Manipulation
☆29Jan 14, 2026Updated last month
RoboDita / Dita
View on GitHub
ICCV2025
☆158Dec 10, 2025Updated 2 months ago
W-Ted / N3D-VLM
View on GitHub
Official code for paper: N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models
☆87Jan 14, 2026Updated last month
declare-lab / nora-1.5
View on GitHub
NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards
☆93Jan 11, 2026Updated last month
Kidrauh / flow3r
View on GitHub
☆43Updated this week
michaelyuancb / motiontrans-pi0
View on GitHub
Pi0-VLA Repository of "MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies"
☆26Sep 25, 2025Updated 5 months ago
upc-ghy / GraspFast
View on GitHub
GraspFast: Multi-stage Lightweight 6-DoF Grasp Pose Detection with RGB-D Image
☆23Jun 20, 2025Updated 8 months ago
m2diffuser / M2Diffuser
View on GitHub
Official implementation of T-PAMI25 paper "M²Diffuser: Diffusion-based Trajectory Optimization for Mobile Manipulation in 3D Scenes"
☆109Jun 17, 2025Updated 8 months ago
CladernyJorn / UP-VLA
View on GitHub
Official PyTorch implementation for ICML 2025 paper: UP-VLA.
☆56Jan 20, 2026Updated last month
hrh6666 / MoE-Loco
View on GitHub
Official Implementation of MoE-Loco: Mixture of Experts for Multitask Locomotion
☆34Oct 22, 2025Updated 4 months ago
henryhcliu / robodex_vlm
View on GitHub
Extended implementation of RoboDexVLM (IROS 2025)
☆31Nov 13, 2025Updated 3 months ago
Taokt / Quest2ROS2
View on GitHub
☆22Feb 15, 2026Updated 2 weeks ago
Max-Fu / otter
View on GitHub
[ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
☆115Apr 14, 2025Updated 10 months ago
nickgkan / 3d_flowmatch_actor
View on GitHub
Code for the paper "3D FlowMatch Actor: Unified 3D Policy for Single- and Dual-Arm Manipulation"
☆32Aug 18, 2025Updated 6 months ago
ControlVLA / ControlVLA
View on GitHub
Code Repository for ControlVLA, CoRL2025.
☆85Oct 26, 2025Updated 4 months ago
facebookresearch / AINA
View on GitHub
Official implementation of Dexterity from Smart Lenses Multi-Fingered Robot Manipulation with In-the-Wild Human Demonstrations. Project w…
☆45Dec 26, 2025Updated 2 months ago
EmbodiedFoundation / AnyPos
View on GitHub
AnyPos: Automated Task-Agnostic Actions for Bimanual Manipulation
☆35Jul 25, 2025Updated 7 months ago
AlanJiang98 / SOLAMI
View on GitHub
Official Code of CVPR 2025 paper "SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters"
☆52Jul 13, 2025Updated 7 months ago
vlc-robot / polarnet
View on GitHub
[CoRL2023] Official PyTorch implementation of PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation
☆42Jun 4, 2024Updated last year
MCG-NJU / Tra-MoE
View on GitHub
[CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning
☆56Apr 1, 2025Updated 11 months ago
TQTQliu / Light-X
View on GitHub
[ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control
☆167Dec 11, 2025Updated 2 months ago
bagh2178 / GC-VLN
View on GitHub
[CoRL 2025] GC-VLN: Instruction as Graph Constraints for Training-free Vision-and-Language Navigation
☆63Sep 16, 2025Updated 5 months ago
VAST-AI-Research / SeqTex
View on GitHub
[SIGGRAPH Asia 2025] Official github repo of SeqTex, an end-to-end 3D texture generation method using video diffusion priors.
☆38Dec 12, 2025Updated 2 months ago