The official implementation of "DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation". (arXiv 2601.22153)
☆277May 3, 2026Updated last month
Alternatives and similar repositories for DynamicVLA
Users that are interested in DynamicVLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆115Mar 31, 2026Updated 2 months ago
- Benchmarking memory-augmented robotic generalist policies☆110Updated this week
- ReSemAct: Advancing Fine-Grained Robotic Manipulation via Semantic Structuring and Affordance Refinement☆17Jan 5, 2026Updated 5 months ago
- VLS: Steering Pretrained Robot Policies via Vision–Language Models☆62Mar 29, 2026Updated 2 months ago
- AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation☆46Feb 23, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Reinforcing Action Policies by Prophesying☆41Nov 26, 2025Updated 6 months ago
- N2M: Bridging Navigation and Manipulation by Learning Initial Pose Preference from Rollout☆36May 25, 2026Updated 2 weeks ago
- The official repository for the paper "Real-world Reinforcement Learning from Suboptimal Interventions”.☆53Apr 23, 2026Updated last month
- [ACL 2026 Findings] CoV: Chain-of-View Prompting for Spatial Reasoning☆61Apr 7, 2026Updated 2 months ago
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆116Apr 14, 2025Updated last year
- Implementation of PegasusFlow: Parallel Rolling-Denoising Score Sampling for Robot Diffusion Planner Flow Matching☆22May 25, 2026Updated 2 weeks ago
- ICCV2025☆168Dec 10, 2025Updated 6 months ago
- [CVPR 2026] HiF-VLA: An efficient, bidirectional spatiotemporal expansion Vision-Language-Action Model☆65Mar 11, 2026Updated 3 months ago
- Official repo for "TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders"☆25Apr 9, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official implementation of HEAD CoRL 2025☆26Aug 22, 2025Updated 9 months ago
- StereoVLA is powered by stereo vision and supports flexible deployment with high tolerance to camera pose variations.☆64May 21, 2026Updated 3 weeks ago
- code for Imagination-Policy☆15Dec 1, 2024Updated last year
- Code Repository for ControlVLA, CoRL2025.☆94Oct 26, 2025Updated 7 months ago
- 复旦研究生入学教育测试☆23Aug 28, 2025Updated 9 months ago
- VLA-0: Building State-of-the-Art VLAs with Zero Modification☆483Feb 21, 2026Updated 3 months ago
- [IROS 2025] ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World Model☆44Jun 26, 2025Updated 11 months ago
- ☆143Aug 27, 2025Updated 9 months ago
- Official Implementation of MoE-Loco: Mixture of Experts for Multitask Locomotion☆51Oct 22, 2025Updated 7 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [CVPR2025] BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding☆54Feb 5, 2026Updated 4 months ago
- Official implementation of T-PAMI25 paper "M²Diffuser: Diffusion-based Trajectory Optimization for Mobile Manipulation in 3D Scenes"☆119Jun 17, 2025Updated 11 months ago
- Uncertainty-aware Latent Safety Filters for Avoiding Out-of-Distribution Failures (CoRL 2025)☆29Sep 23, 2025Updated 8 months ago
- [CVPR'2026]: MoRe: Motion-aware Feed-forward 4D Reconstruction Transformer☆66Apr 21, 2026Updated last month
- [ICLR 2026] RoboInter: A Holistic Intermediate Representation Suite Towards Robotic Manipulation☆144Feb 14, 2026Updated 3 months ago
- ☆41Mar 6, 2026Updated 3 months ago
- Official Code of CVPR 2025 paper "SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters"☆56Jul 13, 2025Updated 10 months ago
- Using apriltag & apriltag_ros & bluefox2 & OpenCV & VISP.☆17May 20, 2020Updated 6 years ago
- ☆58Apr 8, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Implementation of Mimic-Video, Video-Action Models for SOTA Generalizable Robot Control Beyond VLAs☆110May 31, 2026Updated last week
- Code for "AffordanceLLM: Grounding Affordance from Vision Language Models"☆14Oct 18, 2024Updated last year
- TrackGPT: Track What You Need in Videos via Text Prompts☆25May 16, 2023Updated 3 years ago
- ☆80Feb 27, 2026Updated 3 months ago
- ☆35Feb 12, 2026Updated 4 months ago
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆47Nov 21, 2025Updated 6 months ago
- ☆33Nov 20, 2025Updated 6 months ago