The official implementation of "DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation". (arXiv 2601.22153)
☆151Jan 30, 2026Updated last month
Alternatives and similar repositories for DynamicVLA
Users that are interested in DynamicVLA are comparing it to the libraries listed below
Sorting:
- AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation☆31Updated this week
- A Curated List of Vision-Language-Action (VLA) Research☆61Updated this week
- N2M: Bridging Navigation and Manipulation by Learning Initial Pose Preference from Rollout☆28Nov 21, 2025Updated 3 months ago
- ☆29Feb 12, 2026Updated 2 weeks ago
- ReSemAct: Advancing Fine-Grained Robotic Manipulation via Semantic Structuring and Affordance Refinement☆17Jan 5, 2026Updated last month
- [ICLR 2026] RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation☆30Feb 5, 2026Updated 3 weeks ago
- [ICCV 2025] Official repo of "EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow"☆27Oct 16, 2025Updated 4 months ago
- FieldGen is a semi-automatic data generation framework that enables scalable collection of diverse, high-quality real-world manipulation …☆25Oct 28, 2025Updated 4 months ago
- Minute-long video generation at 24FPS.☆50Feb 2, 2026Updated 3 weeks ago
- [CVPR'2025] "DexHandDiff: Interaction-aware Diffusion Planning for Adaptive Dexterous Manipulation"☆20Jul 3, 2025Updated 7 months ago
- Reinforcing Action Policies by Prophesying☆40Nov 26, 2025Updated 3 months ago
- Code for "AffordanceLLM: Grounding Affordance from Vision Language Models"☆14Oct 18, 2024Updated last year
- StereoVLA is powered by stereo vision and supports flexible deployment with high tolerance to camera pose variations.☆52Jan 12, 2026Updated last month
- ☆33Nov 26, 2025Updated 3 months ago
- Galaxea's first diffusion policy release☆38Aug 18, 2025Updated 6 months ago
- CoV: Chain-of-View Prompting for Spatial Reasoning☆51Jan 23, 2026Updated last month
- [ICRA 2026] 🌠 DSPv2: Improved Dense Policy for Effective and Generalizable Whole-body Mobile Manipulation☆29Jan 14, 2026Updated last month
- ICCV2025☆158Dec 10, 2025Updated 2 months ago
- Official code for paper: N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models☆87Jan 14, 2026Updated last month
- NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards☆93Jan 11, 2026Updated last month
- ☆43Updated this week
- Pi0-VLA Repository of "MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies"☆26Sep 25, 2025Updated 5 months ago
- GraspFast: Multi-stage Lightweight 6-DoF Grasp Pose Detection with RGB-D Image☆23Jun 20, 2025Updated 8 months ago
- Official implementation of T-PAMI25 paper "M²Diffuser: Diffusion-based Trajectory Optimization for Mobile Manipulation in 3D Scenes"☆109Jun 17, 2025Updated 8 months ago
- Official PyTorch implementation for ICML 2025 paper: UP-VLA.☆56Jan 20, 2026Updated last month
- Official Implementation of MoE-Loco: Mixture of Experts for Multitask Locomotion☆34Oct 22, 2025Updated 4 months ago
- Extended implementation of RoboDexVLM (IROS 2025)☆31Nov 13, 2025Updated 3 months ago
- ☆22Feb 15, 2026Updated 2 weeks ago
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆115Apr 14, 2025Updated 10 months ago
- Code for the paper "3D FlowMatch Actor: Unified 3D Policy for Single- and Dual-Arm Manipulation"☆32Aug 18, 2025Updated 6 months ago
- Code Repository for ControlVLA, CoRL2025.☆85Oct 26, 2025Updated 4 months ago
- Official implementation of Dexterity from Smart Lenses Multi-Fingered Robot Manipulation with In-the-Wild Human Demonstrations. Project w…☆45Dec 26, 2025Updated 2 months ago
- AnyPos: Automated Task-Agnostic Actions for Bimanual Manipulation☆35Jul 25, 2025Updated 7 months ago
- Official Code of CVPR 2025 paper "SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters"☆52Jul 13, 2025Updated 7 months ago
- [CoRL2023] Official PyTorch implementation of PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation☆42Jun 4, 2024Updated last year
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning☆56Apr 1, 2025Updated 11 months ago
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆167Dec 11, 2025Updated 2 months ago
- [CoRL 2025] GC-VLN: Instruction as Graph Constraints for Training-free Vision-and-Language Navigation☆63Sep 16, 2025Updated 5 months ago
- [SIGGRAPH Asia 2025] Official github repo of SeqTex, an end-to-end 3D texture generation method using video diffusion priors.☆38Dec 12, 2025Updated 2 months ago