The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)
β128Nov 15, 2025Updated 7 months ago
Alternatives and similar repositories for VQ-VLA
Users that are interested in VQ-VLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learningβ91Oct 14, 2024Updated last year
- π― Point cloud 6DoF pose estimation via Central Voting PPF (C++ reproduction of TIP 2021 paper).β13Nov 26, 2023Updated 2 years ago
- Open-source implementations on real robotsβ35Nov 25, 2024Updated last year
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representationβ176Jun 19, 2025Updated last year
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioningβ57Apr 1, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.β108Oct 6, 2025Updated 8 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Predictionβ44Aug 9, 2025Updated 10 months ago
- [CoRL 2025] Pretraining code for FLOWER VLA on OXEβ41Sep 22, 2025Updated 8 months ago
- [ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modelingβ481Apr 16, 2026Updated 2 months ago
- The repository for a thorough empirical evaluation of pre-trained vision model performance across different downstream policy learning meβ¦β24Aug 19, 2023Updated 2 years ago
- β¨β¨γNeurIPS 2025γOfficial implementation of BridgeVLAβ189Apr 5, 2026Updated 2 months ago
- [ICCV 2025] Dense Policy (DSP): Bidirectional Autoregressive Learning of Actionsβ79Jan 14, 2026Updated 5 months ago
- [ICRA 2026] UltraDexGrasp: Learning Universal Dexterous Grasping for Bimanual Robots with Synthetic Dataβ78Mar 6, 2026Updated 3 months ago
- [ICLR 2026] Unified Vision-Language-Action Modelβ306Oct 15, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [NeurIPS'24] NeuRodin: A Two-stage Framework for High-Fidelity Neural Surface Reconstructionβ127Sep 26, 2024Updated last year
- β32May 16, 2025Updated last year
- N2M: Bridging Navigation and Manipulation by Learning Initial Pose Preference from Rolloutβ36May 25, 2026Updated 3 weeks ago
- Sim2real robot manipulation utilizing GS modelingβ14Feb 19, 2025Updated last year
- β46Mar 26, 2025Updated last year
- Code of WinT3R: Window-Based Streaming Rrconstruction With Camera Token Poolβ228Mar 4, 2026Updated 3 months ago
- official implementation for our paper Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance (CoRL 2024)β54Apr 28, 2025Updated last year
- DeepVerse: 4D Autoregressive Video Generation as a World Modelβ231Aug 11, 2025Updated 10 months ago
- β160Oct 15, 2024Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Fast-in-Slow: A Dual-System Foundation Model Unifying Fast Manipulation within Slow Reasoningβ155Aug 1, 2025Updated 10 months ago
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.β50Apr 22, 2026Updated last month
- [NIPS 2025] FreqPolicy: Frequency Autoregressive Visuomotor Policy with Continuous Tokensβ21Oct 12, 2025Updated 8 months ago
- Infinite-Forcing: Towards Infinite-Long Video Generationβ153Nov 13, 2025Updated 7 months ago
- Official implementation of "From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction"β72Nov 25, 2025Updated 6 months ago
- LAP: Language-Action Pre-Training Enables Zero-Shot Cross Embodiment Transferβ147May 20, 2026Updated 3 weeks ago
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulationβ304Jul 8, 2025Updated 11 months ago
- Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."β130May 26, 2026Updated 3 weeks ago
- β19Jul 7, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modelingβ595Oct 26, 2025Updated 7 months ago
- Author's implementation of DemoDiffusion.β67Jan 14, 2026Updated 5 months ago
- β100Sep 4, 2024Updated last year
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Visionβ46Dec 2, 2025Updated 6 months ago
- β55Sep 21, 2025Updated 8 months ago
- Enhancing Representations through Heterogeneous Self-Supervised Learning (TPAMI 2025)β15May 2, 2025Updated last year
- Evaluation for 3D reconstruction, includes monocular depth, video depth, relative camera pose & multi-view point map estimation.β21Aug 26, 2025Updated 9 months ago