siyuhsu / vla-cacheLinks
Official implementation of paper "VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation"
☆13Updated last week
Alternatives and similar repositories for vla-cache
Users that are interested in vla-cache are comparing it to the libraries listed below
Sorting:
- [Arxiv 2025: MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation]☆36Updated 2 months ago
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning☆36Updated 2 months ago
- This repository summarizes recent advances in the VLA + RL paradigm and provides a taxonomic classification of relevant works.☆133Updated this week
- [ICML 2025] Rethinking Latent Redundancy in Behavior Cloning: An Information Bottleneck Approach for Robot Manipulation☆23Updated last month
- RoBridge: A Hierarchical Architecture Bridging Cognition and Execution for General Robotic Manipulation☆27Updated last month
- ☆95Updated last month
- Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"☆95Updated 4 months ago
- A comprehensive list of papers about dual-system VLA models, including papers, codes, and related websites.☆48Updated 3 weeks ago
- ☆25Updated last year
- Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.☆131Updated last month
- ☆49Updated last week
- ☆78Updated 3 weeks ago
- ☆54Updated 4 months ago
- 🦾 A Dual-System VLA with System2 Thinking☆38Updated this week
- Official PyTorch implementation for NeurIPS 2024 paper: Prediction with Action.☆28Updated 5 months ago
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆83Updated 2 months ago
- This is the official repo for [CoRL 2024] Contrastive Imitation Learning for Language-guided Multi-Task Robotic Manipulation☆28Updated 7 months ago
- Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"☆106Updated 3 weeks ago
- ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation☆45Updated 2 months ago
- ☆41Updated 8 months ago
- ☆63Updated 4 months ago
- Interactive Post-Training for Vision-Language-Action Models☆75Updated 3 weeks ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆67Updated 6 months ago
- ☆40Updated 6 months ago
- ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation☆112Updated last month
- Dense Policy: Bidirectional Autoregressive Learning of Actions☆41Updated 3 months ago
- Code & data for "RoboGround: Robotic Manipulation with Grounded Vision-Language Priors" (CVPR 2025)☆17Updated last month
- SDP☆61Updated 8 months ago
- [ICRA 2025] CAGE: Causal Attention Enables Data-Efficient Generalizable Robotic Manipulation☆27Updated 5 months ago
- Official code for: Observe Then Act: Asynchronous Active Vision-Action Model for Robotic Manipulation (OTA)☆17Updated 5 months ago