ustcwhy / BitVLALinks

Official implementation for BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation

☆91

Alternatives and similar repositories for BitVLA

Users that are interested in BitVLA are comparing it to the libraries listed below

Sorting:

declare-lab / nora
NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks
☆191Updated last week
ByteDance-Seed / Chain-of-Action
Official implementation of Chain-of-Action: Trajectory Autoregressive Modeling for Robotic Manipulation. Accepted in NeurIPS 2025.
☆80Updated 3 weeks ago
InternRobotics / InstructVLA
InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation
☆67Updated 2 months ago
OpenHelix-Team / CEED-VLA
Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.
☆44Updated 2 months ago
OpenHelix-Team / VLA-RFT
VLA-RFT: Vision-Language-Action Models with Reinforcement Fine-Tuning
☆93Updated last month
InternRobotics / F1-VLA
F1: A Vision Language Action Model Bridging Understanding and Generation to Actions
☆136Updated last month
pickxiguapi / Embodied-R1
Official code for "Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation"
☆104Updated 3 months ago
hume-vla / hume
🦾 A Dual-System VLA with System2 Thinking
☆121Updated 3 months ago
lmzpai / roboMamba
The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`
☆139Updated 11 months ago
xiaoxiao0406 / VQ-VLA
The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)
☆97Updated 2 weeks ago
FlagOpen / RoboBrain-X0
☆92Updated last month
NVlabs / vla0
VLA-0: Building State-of-the-Art VLAs with Zero Modification
☆312Updated last week
Zhangwenyao1 / DreamVLA
[NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
☆231Updated 2 months ago
yueyang130 / DeeR-VLA
Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"
☆119Updated 9 months ago
alibaba-damo-academy / RynnVLA-001
RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation
☆254Updated last month
MARS-EAI / RoboFactory
[ICCV 2025] RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints
☆95Updated 2 months ago
thuml / RLVR-World
Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934
☆147Updated last month
declare-lab / Emma-X
Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning
☆76Updated 6 months ago
UMass-Embodied-AGI / MindJourney
[NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"
☆98Updated 3 weeks ago
Little-Podi / AdaWorld
[ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".
☆178Updated 5 months ago
zhouzypaul / auto_eval
AutoEval: Autonomous Evaluation of Generalist Robot Manipulation Policies in the Real World | CoRL 2025
☆87Updated 5 months ago
baaivision / UniVLA
Unified Vision-Language-Action Model
☆243Updated last month
zwq2018 / embodied_reasoner
Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks
☆181Updated 2 months ago
FlagOpen / ShareRobot
☆59Updated 7 months ago
OpenGVLab / VeBrain
Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces
☆86Updated 5 months ago
BAAI-DCAI / SpatialBot
The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.
☆319Updated 2 months ago
Fanqi-Lin / OneTwoVLA
Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"
☆199Updated 6 months ago
OpenHelix-Team / LLaVA-VLA
LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [Actively Maintained🔥]
☆173Updated last month
InternRobotics / InternVLA-M1
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
☆286Updated 2 weeks ago
WEIRDLabUW / unified-world-model
Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets
☆158Updated last month