Official implementation for BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation
☆158Mar 2, 2026Updated 3 months ago
Alternatives and similar repositories for BitVLA
Users that are interested in BitVLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV 2026] Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.☆50Sep 15, 2025Updated 9 months ago
- [RSS 2025] Gripper Keypose and Object Pointflow as Interfaces for Bimanual Robotic Manipulation☆79Jul 22, 2025Updated 11 months ago
- FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulation☆13Dec 13, 2024Updated last year
- ☆63Mar 3, 2026Updated 3 months ago
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆118Apr 14, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆68Sep 18, 2025Updated 9 months ago
- This is the official codebase for paper: Scaling Verification Can Be More Effective than Scaling Policy Learning for Vision-Language-Acti…☆55Apr 9, 2026Updated 2 months ago
- AnyPos: Automated Task-Agnostic Actions for Bimanual Manipulation☆38Jul 25, 2025Updated 11 months ago
- ☆21Dec 23, 2025Updated 6 months ago
- Code☆57Jun 6, 2026Updated 3 weeks ago
- Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces☆86Jun 6, 2025Updated last year
- Official repo for From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models☆34Nov 2, 2025Updated 7 months ago
- ☆27Mar 6, 2025Updated last year
- N2M: Bridging Navigation and Manipulation by Learning Initial Pose Preference from Rollout☆37May 25, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code and Data for Paper: Boosting Efficient Reinforcement Learning for Vision-and-Language Navigation With Open-Sourced LLM☆18Feb 7, 2025Updated last year
- ☆12Dec 4, 2024Updated last year
- SeSaMe TAMP + Learning integrated with a Spot robot!☆30Jun 12, 2026Updated 2 weeks ago
- ☆34Oct 27, 2024Updated last year
- ☆37Mar 8, 2026Updated 3 months ago
- A Bimanual-mobile Robot Manipulation Dataset specifically designed for household applications☆17Aug 12, 2024Updated last year
- Official code for "One-Shot Manipulation Strategy Learning by Making Contact Analogies".☆28Feb 7, 2025Updated last year
- [ICCV 2025] DyWA:Dynamics-adaptive World Action Model for Generalizable Non-prehensile Manipulation☆93May 12, 2026Updated last month
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Jun 22, 2026Updated last week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆46Mar 26, 2025Updated last year
- ☆13Jun 22, 2024Updated 2 years ago
- [CoRL 2025 Best Paper Award] Fabrica: Dual-Arm Assembly of General Multi-Part Objects via Integrated Planning and Learning☆90Jan 11, 2026Updated 5 months ago
- [CVPR 2026] Real2Edit2Real: Generating Robotic Demonstrations via a 3D Control Interface☆82Mar 10, 2026Updated 3 months ago
- Granularity-Aware Affordance Understanding from human-object interaction for Dexterous Robotic Functional Grasping☆15Sep 2, 2025Updated 9 months ago
- ☆12Aug 18, 2023Updated 2 years ago
- Fast-in-Slow: A Dual-System Foundation Model Unifying Fast Manipulation within Slow Reasoning☆156Aug 1, 2025Updated 10 months ago
- [RSS 2026] LDA-1B: Scaling Latent Dynamics Action Model via Universal Embodied Data Ingestion☆270May 26, 2026Updated last month
- [NeurIPS 2025] VLA-Cache: Efficient Vision-Language-Action Manipulation via Adaptive Token Caching☆89Feb 27, 2026Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.☆443Nov 11, 2025Updated 7 months ago
- [TPAMI2026 / RSS2025] Code for my paper "You Only Teach Once: Learn One-Shot Bimanual Robotic Manipulation from Video Demonstrations"☆146Jun 19, 2026Updated last week
- ☆23Jun 16, 2025Updated last year
- 🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.☆697Jun 23, 2025Updated last year
- ☆79Sep 19, 2025Updated 9 months ago
- Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io☆398May 17, 2025Updated last year
- Project page for Neural Shell Texture Splatting (ICCV 2025)☆35Oct 14, 2025Updated 8 months ago