The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)
β123Nov 15, 2025Updated 6 months ago
Alternatives and similar repositories for VQ-VLA
Users that are interested in VQ-VLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learningβ91Oct 14, 2024Updated last year
- π― Point cloud 6DoF pose estimation via Central Voting PPF (C++ reproduction of TIP 2021 paper).β13Nov 26, 2023Updated 2 years ago
- Open-source implementations on real robotsβ35Nov 25, 2024Updated last year
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representationβ175Jun 19, 2025Updated 11 months ago
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioningβ57Apr 1, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.β108Oct 6, 2025Updated 7 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Predictionβ44Aug 9, 2025Updated 9 months ago
- [CoRL 2025] Pretraining code for FLOWER VLA on OXEβ39Sep 22, 2025Updated 8 months ago
- [ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modelingβ473Apr 16, 2026Updated last month
- The repository for a thorough empirical evaluation of pre-trained vision model performance across different downstream policy learning meβ¦β24Aug 19, 2023Updated 2 years ago
- β¨β¨γNeurIPS 2025γOfficial implementation of BridgeVLAβ188Apr 5, 2026Updated last month
- [ICCV 2025] Dense Policy (DSP): Bidirectional Autoregressive Learning of Actionsβ78Jan 14, 2026Updated 4 months ago
- [ICRA 2026] UltraDexGrasp: Learning Universal Dexterous Grasping for Bimanual Robots with Synthetic Dataβ75Mar 6, 2026Updated 2 months ago
- [ICLR 2026] Unified Vision-Language-Action Modelβ302Oct 15, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NeurIPS'24] NeuRodin: A Two-stage Framework for High-Fidelity Neural Surface Reconstructionβ127Sep 26, 2024Updated last year
- β32May 16, 2025Updated last year
- N2M: Bridging Navigation and Manipulation by Learning Initial Pose Preference from Rolloutβ35May 13, 2026Updated 2 weeks ago
- Sim2real robot manipulation utilizing GS modelingβ14Feb 19, 2025Updated last year
- β46Mar 26, 2025Updated last year
- Code of WinT3R: Window-Based Streaming Rrconstruction With Camera Token Poolβ228Mar 4, 2026Updated 2 months ago
- DeepVerse: 4D Autoregressive Video Generation as a World Modelβ231Aug 11, 2025Updated 9 months ago
- β158Oct 15, 2024Updated last year
- Fast-in-Slow: A Dual-System Foundation Model Unifying Fast Manipulation within Slow Reasoningβ154Aug 1, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.β49Apr 22, 2026Updated last month
- [NIPS 2025] FreqPolicy: Frequency Autoregressive Visuomotor Policy with Continuous Tokensβ21Oct 12, 2025Updated 7 months ago
- Infinite-Forcing: Towards Infinite-Long Video Generationβ151Nov 13, 2025Updated 6 months ago
- Official implementation of "From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction"β70Nov 25, 2025Updated 6 months ago
- LAP: Language-Action Pre-Training Enables Zero-Shot Cross Embodiment Transferβ138May 20, 2026Updated last week
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulationβ301Jul 8, 2025Updated 10 months ago
- Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."β130Oct 23, 2025Updated 7 months ago
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modelingβ595Oct 26, 2025Updated 7 months ago
- β19Jul 7, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Author's implementation of DemoDiffusion.β66Jan 14, 2026Updated 4 months ago
- β99Sep 4, 2024Updated last year
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Visionβ43Dec 2, 2025Updated 5 months ago
- β55Sep 21, 2025Updated 8 months ago
- Enhancing Representations through Heterogeneous Self-Supervised Learning (TPAMI 2025)β15May 2, 2025Updated last year
- Evaluation for 3D reconstruction, includes monocular depth, video depth, relative camera pose & multi-view point map estimation.β21Aug 26, 2025Updated 9 months ago
- [CoRL2023] Official PyTorch implementation of PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulationβ43Jun 4, 2024Updated last year