The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)
β117Nov 15, 2025Updated 5 months ago
Alternatives and similar repositories for VQ-VLA
Users that are interested in VQ-VLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learningβ91Oct 14, 2024Updated last year
- π― Point cloud 6DoF pose estimation via Central Voting PPF (C++ reproduction of TIP 2021 paper).β13Nov 26, 2023Updated 2 years ago
- Open-source implementations on real robotsβ35Nov 25, 2024Updated last year
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representationβ175Jun 19, 2025Updated 10 months ago
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioningβ56Apr 1, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.β106Oct 6, 2025Updated 6 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Predictionβ44Aug 9, 2025Updated 8 months ago
- [CoRL 2025] Pretraining code for FLOWER VLA on OXEβ38Sep 22, 2025Updated 6 months ago
- LDA-1B: Scaling Latent Dynamics Action Model via Universal Embodied Data Ingestionβ88Mar 18, 2026Updated last month
- [ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modelingβ460Updated this week
- The repository for a thorough empirical evaluation of pre-trained vision model performance across different downstream policy learning meβ¦β24Aug 19, 2023Updated 2 years ago
- β¨β¨γNeurIPS 2025γOfficial implementation of BridgeVLAβ183Apr 5, 2026Updated 2 weeks ago
- [ICCV 2025] Dense Policy: Bidirectional Autoregressive Learning of Actions DSPβ78Jan 14, 2026Updated 3 months ago
- [ICRA 2026] UltraDexGrasp: Learning Universal Dexterous Grasping for Bimanual Robots with Synthetic Dataβ64Mar 6, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICLR 2026] Unified Vision-Language-Action Modelβ295Oct 15, 2025Updated 6 months ago
- [NeurIPS'24] NeuRodin: A Two-stage Framework for High-Fidelity Neural Surface Reconstructionβ126Sep 26, 2024Updated last year
- β33May 16, 2025Updated 11 months ago
- N2M: Bridging Navigation and Manipulation by Learning Initial Pose Preference from Rolloutβ30Nov 21, 2025Updated 4 months ago
- Code of WinT3R: Window-Based Streaming Rrconstruction With Camera Token Poolβ223Mar 4, 2026Updated last month
- Sim2real robot manipulation utilizing GS modelingβ14Feb 19, 2025Updated last year
- β43Mar 26, 2025Updated last year
- DeepVerse: 4D Autoregressive Video Generation as a World Modelβ223Aug 11, 2025Updated 8 months ago
- LAP: Language-Action Pre-Training Enables Zero-Shot Cross Embodiment Transferβ111Updated this week
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β150Oct 15, 2024Updated last year
- Fast-in-Slow: A Dual-System Foundation Model Unifying Fast Manipulation within Slow Reasoningβ144Aug 1, 2025Updated 8 months ago
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.β41Apr 8, 2026Updated last week
- Official implementation of "From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction"β64Nov 25, 2025Updated 4 months ago
- [NIPS 2025] FreqPolicy: Frequency Autoregressive Visuomotor Policy with Continuous Tokensβ22Oct 12, 2025Updated 6 months ago
- Infinite-Forcing: Towards Infinite-Long Video Generationβ148Nov 13, 2025Updated 5 months ago
- Author's implementation of DemoDiffusion.β64Jan 14, 2026Updated 3 months ago
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulationβ294Jul 8, 2025Updated 9 months ago
- Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."β128Oct 23, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modelingβ586Oct 26, 2025Updated 5 months ago
- β19Jul 7, 2024Updated last year
- β96Sep 4, 2024Updated last year
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Visionβ40Dec 2, 2025Updated 4 months ago
- β55Sep 21, 2025Updated 6 months ago
- Enhancing Representations through Heterogeneous Self-Supervised Learning (TPAMI 2025)β15May 2, 2025Updated 11 months ago
- Evaluation for 3D reconstruction, includes monocular depth, video depth, relative camera pose & multi-view point map estimation.β20Aug 26, 2025Updated 7 months ago