The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)
β119Nov 15, 2025Updated 5 months ago
Alternatives and similar repositories for VQ-VLA
Users that are interested in VQ-VLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learningβ91Oct 14, 2024Updated last year
- π― Point cloud 6DoF pose estimation via Central Voting PPF (C++ reproduction of TIP 2021 paper).β13Nov 26, 2023Updated 2 years ago
- Open-source implementations on real robotsβ35Nov 25, 2024Updated last year
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representationβ175Jun 19, 2025Updated 10 months ago
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioningβ57Apr 1, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.β107Oct 6, 2025Updated 7 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Predictionβ44Aug 9, 2025Updated 9 months ago
- [CoRL 2025] Pretraining code for FLOWER VLA on OXEβ38Sep 22, 2025Updated 7 months ago
- [ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modelingβ467Apr 16, 2026Updated 3 weeks ago
- The repository for a thorough empirical evaluation of pre-trained vision model performance across different downstream policy learning meβ¦β24Aug 19, 2023Updated 2 years ago
- β¨β¨γNeurIPS 2025γOfficial implementation of BridgeVLAβ186Apr 5, 2026Updated last month
- [ICCV 2025] Dense Policy (DSP): Bidirectional Autoregressive Learning of Actionsβ78Jan 14, 2026Updated 3 months ago
- [ICRA 2026] UltraDexGrasp: Learning Universal Dexterous Grasping for Bimanual Robots with Synthetic Dataβ67Mar 6, 2026Updated 2 months ago
- β72Apr 28, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICLR 2026] Unified Vision-Language-Action Modelβ295Oct 15, 2025Updated 6 months ago
- [NeurIPS'24] NeuRodin: A Two-stage Framework for High-Fidelity Neural Surface Reconstructionβ126Sep 26, 2024Updated last year
- β32May 16, 2025Updated 11 months ago
- N2M: Bridging Navigation and Manipulation by Learning Initial Pose Preference from Rolloutβ32Nov 21, 2025Updated 5 months ago
- Code of WinT3R: Window-Based Streaming Rrconstruction With Camera Token Poolβ225Mar 4, 2026Updated 2 months ago
- Sim2real robot manipulation utilizing GS modelingβ14Feb 19, 2025Updated last year
- β46Mar 26, 2025Updated last year
- DeepVerse: 4D Autoregressive Video Generation as a World Modelβ226Aug 11, 2025Updated 8 months ago
- β156Oct 15, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Fast-in-Slow: A Dual-System Foundation Model Unifying Fast Manipulation within Slow Reasoningβ147Aug 1, 2025Updated 9 months ago
- LAP: Language-Action Pre-Training Enables Zero-Shot Cross Embodiment Transferβ126Apr 26, 2026Updated last week
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.β45Apr 22, 2026Updated 2 weeks ago
- [NIPS 2025] FreqPolicy: Frequency Autoregressive Visuomotor Policy with Continuous Tokensβ21Oct 12, 2025Updated 6 months ago
- Infinite-Forcing: Towards Infinite-Long Video Generationβ149Nov 13, 2025Updated 5 months ago
- Official implementation of "From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction"β66Nov 25, 2025Updated 5 months ago
- Author's implementation of DemoDiffusion.β64Jan 14, 2026Updated 3 months ago
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulationβ298Jul 8, 2025Updated 10 months ago
- [RSS 2026] LDA-1B: Scaling Latent Dynamics Action Model via Universal Embodied Data Ingestionβ193Apr 29, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."β129Oct 23, 2025Updated 6 months ago
- β19Jul 7, 2024Updated last year
- β97Sep 4, 2024Updated last year
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Visionβ42Dec 2, 2025Updated 5 months ago
- β55Sep 21, 2025Updated 7 months ago
- Enhancing Representations through Heterogeneous Self-Supervised Learning (TPAMI 2025)β15May 2, 2025Updated last year
- Evaluation for 3D reconstruction, includes monocular depth, video depth, relative camera pose & multi-view point map estimation.β20Aug 26, 2025Updated 8 months ago