xiaoxiao0406 / VQ-VLAView external linksLinks
The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)
☆110Nov 15, 2025Updated 3 months ago
Alternatives and similar repositories for VQ-VLA
Users that are interested in VQ-VLA are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆91Oct 14, 2024Updated last year
- ☆13Nov 26, 2023Updated 2 years ago
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning☆56Apr 1, 2025Updated 10 months ago
- Open-source implementations on real robots☆35Nov 25, 2024Updated last year
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆172Jun 19, 2025Updated 7 months ago
- [ICCV 2025] Dense Policy: Bidirectional Autoregressive Learning of Actions DSP☆72Jan 14, 2026Updated last month
- The repository for a thorough empirical evaluation of pre-trained vision model performance across different downstream policy learning me…☆24Aug 19, 2023Updated 2 years ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆44Aug 9, 2025Updated 6 months ago
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.☆101Oct 6, 2025Updated 4 months ago
- ☆19Jul 7, 2024Updated last year
- [CoRL 2025] Pretraining code for FLOWER VLA on OXE☆29Sep 22, 2025Updated 4 months ago
- ☆143Oct 15, 2024Updated last year
- Fast-in-Slow: A Dual-System Foundation Model Unifying Fast Manipulation within Slow Reasoning☆143Aug 1, 2025Updated 6 months ago
- ☆39Mar 26, 2025Updated 10 months ago
- Official PyTorch implementation for ICML 2025 paper: UP-VLA.☆55Jan 20, 2026Updated 3 weeks ago
- ✨✨【NeurIPS 2025】Official implementation of BridgeVLA☆170Sep 20, 2025Updated 4 months ago
- ☆33May 16, 2025Updated 9 months ago
- N2M: Bridging Navigation and Manipulation by Learning Initial Pose Preference from Rollout☆28Nov 21, 2025Updated 2 months ago
- Code for the paper Robot Data Curation with Mutual Information Estimators☆28Apr 22, 2025Updated 9 months ago
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation☆279Jul 8, 2025Updated 7 months ago
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆33Dec 2, 2025Updated 2 months ago
- Official implementation of SPGrasp: A framework for dynamic grasp synthesis from sparse spatiotemporal prompts.☆19Jan 6, 2026Updated last month
- F1: A Vision Language Action Model Bridging Understanding and Generation to Actions☆160Jan 2, 2026Updated last month
- Author's implementation of DemoDiffusion.☆61Jan 14, 2026Updated last month
- [ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆429Jan 7, 2026Updated last month
- Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."☆123Oct 23, 2025Updated 3 months ago
- [CoRL2023] Official PyTorch implementation of PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation☆42Jun 4, 2024Updated last year
- [ICLR 2026] Unified Vision-Language-Action Model☆274Oct 15, 2025Updated 4 months ago
- Official implementation of "From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction"☆59Nov 25, 2025Updated 2 months ago
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation☆100Jul 31, 2024Updated last year
- [NIPS 2025] FreqPolicy: Frequency Autoregressive Visuomotor Policy with Continuous Tokens☆20Oct 12, 2025Updated 4 months ago
- Just wanna see what type and how many GPUs/TPUs are used in CVPR 2025 oral papers. Fun vibe coding with LLMs.☆12Apr 24, 2025Updated 9 months ago
- Autoregressive Policy for Robot Learning (RA-L 2025)☆147Mar 25, 2025Updated 10 months ago
- [CoRL 2023] XSkill: cross embodiment skill discovery☆66Mar 25, 2024Updated last year
- Official implementation of "Data Scaling Laws in Imitation Learning for Robotic Manipulation"☆203Nov 13, 2024Updated last year
- DROID Policy Learning and Evaluation☆267Apr 22, 2025Updated 9 months ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆79Dec 12, 2024Updated last year
- [RA-L] Lost & Found dynamically tracks object poses from egocentric videos while updating a scene graph, enabling richer semantic 3D unde…☆55Sep 29, 2025Updated 4 months ago
- ☆78May 23, 2025Updated 8 months ago