The official implementation of "DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation". (arXiv 2601.22153)
☆299May 3, 2026Updated last month
Alternatives and similar repositories for DynamicVLA
Users that are interested in DynamicVLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Benchmarking memory-augmented robotic generalist policies☆118Jun 18, 2026Updated last week
- ReSemAct: Advancing Fine-Grained Robotic Manipulation via Semantic Structuring and Affordance Refinement☆17Jan 5, 2026Updated 5 months ago
- VLS: Steering Pretrained Robot Policies via Vision–Language Models☆63Mar 29, 2026Updated 3 months ago
- AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation☆49Feb 23, 2026Updated 4 months ago
- Reinforcing Action Policies by Prophesying☆41Nov 26, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- N2M: Bridging Navigation and Manipulation by Learning Initial Pose Preference from Rollout☆37May 25, 2026Updated last month
- The official repository for the paper "Real-world Reinforcement Learning from Suboptimal Interventions”.☆55Apr 23, 2026Updated 2 months ago
- [ACL 2026 Findings] CoV: Chain-of-View Prompting for Spatial Reasoning☆63Apr 7, 2026Updated 2 months ago
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆118Apr 14, 2025Updated last year
- Implementation of PegasusFlow: Parallel Rolling-Denoising Score Sampling for Robot Diffusion Planner Flow Matching☆24May 25, 2026Updated last month
- ICCV2025☆171Dec 10, 2025Updated 6 months ago
- [CVPR 2026] HiF-VLA: An efficient, bidirectional spatiotemporal expansion Vision-Language-Action Model☆70Mar 11, 2026Updated 3 months ago
- Official repo for "TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders"☆25Apr 9, 2026Updated 2 months ago
- Official implementation of HEAD CoRL 2025☆26Aug 22, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- StereoVLA is powered by stereo vision and supports flexible deployment with high tolerance to camera pose variations.☆65May 21, 2026Updated last month
- code for Imagination-Policy☆15Dec 1, 2024Updated last year
- [ICLR 2026] RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation☆36Mar 10, 2026Updated 3 months ago
- Code Repository for ControlVLA, CoRL2025.☆96Oct 26, 2025Updated 8 months ago
- 复旦研究生入学教育测试☆23Aug 28, 2025Updated 10 months ago
- VLA-0: Building State-of-the-Art VLAs with Zero Modification☆486Feb 21, 2026Updated 4 months ago
- [IROS 2025] ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World Model☆44Jun 26, 2025Updated last year
- Official Implementation of MoE-Loco: Mixture of Experts for Multitask Locomotion☆51Oct 22, 2025Updated 8 months ago
- ☆147Aug 27, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [CVPR2025] BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding☆55Feb 5, 2026Updated 4 months ago
- Official implementation of T-PAMI25 paper "M²Diffuser: Diffusion-based Trajectory Optimization for Mobile Manipulation in 3D Scenes"☆121Jun 17, 2025Updated last year
- Uncertainty-aware Latent Safety Filters for Avoiding Out-of-Distribution Failures (CoRL 2025)☆29Jun 23, 2026Updated last week
- [CVPR'2026]: MoRe: Motion-aware Feed-forward 4D Reconstruction Transformer☆70Apr 21, 2026Updated 2 months ago
- [ICLR 2026] RoboInter: A Holistic Intermediate Representation Suite Towards Robotic Manipulation☆151Feb 14, 2026Updated 4 months ago
- ☆45Mar 6, 2026Updated 3 months ago
- Official Code of CVPR 2025 paper "SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters"☆56Jul 13, 2025Updated 11 months ago
- Using apriltag & apriltag_ros & bluefox2 & OpenCV & VISP.☆17May 20, 2020Updated 6 years ago
- TrackGPT: Track What You Need in Videos via Text Prompts☆25May 16, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆83Feb 27, 2026Updated 4 months ago
- ☆35Feb 12, 2026Updated 4 months ago
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆48Nov 21, 2025Updated 7 months ago
- ☆33Nov 20, 2025Updated 7 months ago
- ☆241Jun 1, 2026Updated last month
- Implementation of Mimic-Video, Video-Action Models for SOTA Generalizable Robot Control Beyond VLAs☆113May 31, 2026Updated last month
- ☆28Aug 6, 2024Updated last year