Causal video-action world model for generalist robot control
☆827Feb 27, 2026Updated 3 weeks ago
Alternatives and similar repositories for lingbot-va
Users that are interested in lingbot-va are comparing it to the libraries listed below
Sorting:
- Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals☆1,290Updated this week
- A Pragmatic VLA Foundation Model☆941Mar 12, 2026Updated last week
- ☆216Jan 31, 2026Updated last month
- ☆57Jan 25, 2026Updated last month
- RoboTwin 2.0 Offical Repo☆2,053Updated this week
- This is the official implementation of the paper "ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy".☆332Nov 11, 2025Updated 4 months ago
- RynnVLA-002: A Unified Vision-Language-Action and World Model☆947Dec 2, 2025Updated 3 months ago
- UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation☆135Jun 10, 2025Updated 9 months ago
- Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the …☆920Mar 3, 2026Updated 2 weeks ago
- Source code of DreamDojo by the NVIDIA GEAR Team.☆625Mar 4, 2026Updated 2 weeks ago
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation☆283Jul 8, 2025Updated 8 months ago
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆180Jun 20, 2025Updated 9 months ago
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆485Jan 22, 2025Updated last year
- ☆10,755Updated this week
- RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation☆1,650Jan 21, 2026Updated 2 months ago
- Masked Depth Modeling for Spatial Perception☆946Feb 14, 2026Updated last month
- Galaxea's open-source VLA repository☆546Feb 14, 2026Updated last month
- Source materials for CoinFT☆29Jan 23, 2026Updated last month
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)☆350Jul 23, 2025Updated 7 months ago
- OpenVLA: An open-source vision-language-action model for robotic manipulation.☆5,542Mar 23, 2025Updated 11 months ago
- Cosmos Policy☆649Jan 23, 2026Updated last month
- Spirit-v1.5: A Robotic Foundation Model by Spirit AI☆536Jan 14, 2026Updated 2 months ago
- ☆41Mar 19, 2025Updated last year
- Official implementation of Chain-of-Action: Trajectory Autoregressive Modeling for Robotic Manipulation. Accepted in NeurIPS 2025.☆99Dec 13, 2025Updated 3 months ago
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…☆1,005Dec 20, 2025Updated 3 months ago
- A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (…☆2,780Updated this week
- 🌊 [ECCV'24 Oral] MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images☆1,210Aug 27, 2025Updated 6 months ago
- Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets☆198Oct 8, 2025Updated 5 months ago
- Cameras as Relative Positional Encoding☆690Dec 18, 2025Updated 3 months ago
- [RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations☆1,294Oct 17, 2025Updated 5 months ago
- Official code of RDT 2☆740Feb 7, 2026Updated last month
- 🎁 A collection of utilities for LeRobot.☆914Updated this week
- Theia: Distilling Diverse Vision Foundation Models for Robot Learning☆270Nov 6, 2025Updated 4 months ago
- ICCV 2025 | TesserAct: Learning 4D Embodied World Models☆384Aug 4, 2025Updated 7 months ago
- Tensor's VLA Training Infrastructure for Real-World Robotics in PyTorch☆119Mar 13, 2026Updated last week
- [RSS 2025] Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation☆308Feb 22, 2026Updated last month
- Reasoning in Space via Grounding in the World (ICLR 2025)☆50Nov 3, 2025Updated 4 months ago
- Benchmarking Knowledge Transfer in Lifelong Robot Learning☆1,611Mar 15, 2025Updated last year
- [CVPR 2026] "E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training" official implementation.☆271Feb 24, 2026Updated 3 weeks ago