[AAAI26 oral] CronusVLA: Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling
☆88Jan 11, 2026Updated last month
Alternatives and similar repositories for CronusVLA
Users that are interested in CronusVLA are comparing it to the libraries listed below
Sorting:
- [ICLR 2026] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation☆102Jan 27, 2026Updated last month
- [ICLR 2026] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence☆78Updated this week
- [RSS 2025] Gripper Keypose and Object Pointflow as Interfaces for Bimanual Robotic Manipulation☆76Jul 22, 2025Updated 7 months ago
- Code & data for "RoboGround: Robotic Manipulation with Grounded Vision-Language Priors" (CVPR 2025)☆38May 25, 2025Updated 9 months ago
- [NeurIPS 2025] OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding☆71Sep 29, 2025Updated 5 months ago
- A versatile, all-in-one toolbox for whole-body humanoid robot control.☆171Oct 10, 2025Updated 4 months ago
- InternRobotics' open-source toolbox for vision-based embodied spatial intelligence.☆47Sep 18, 2025Updated 5 months ago
- An All-in-one robot manipulation learning suite for policy models training and evaluation on various datasets and benchmarks.☆169Oct 15, 2025Updated 4 months ago
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation☆280Jul 8, 2025Updated 7 months ago
- Official implementation of "Re3Sim: Generating High-Fidelity Simulation Data via 3D-Photorealistic Real-to-Sim for Robotic Manipulation"☆133Sep 18, 2025Updated 5 months ago
- [ICRA 2026] Official implementation of "Towards Adaptable Humanoid Control via Adaptive Motion Tracking"☆201Oct 17, 2025Updated 4 months ago
- A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic Manipulation☆58Apr 1, 2025Updated 11 months ago
- [RSS 2025] Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation☆166Feb 22, 2026Updated last week
- [CVPR 2025] Official implementation of "GenManip: LLM-driven Simulation for Generalizable Instruction-Following Manipulation"☆144Jan 15, 2026Updated last month
- InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy☆374Feb 11, 2026Updated 2 weeks ago
- InternRobotics' open platform for building generalized navigation foundation models.☆688Feb 11, 2026Updated 2 weeks ago
- ROS wrapper of Nvidia Contact-graspnet model.☆17Jul 3, 2023Updated 2 years ago
- [CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI☆652Jun 13, 2025Updated 8 months ago
- Twisting Lids Off with Two Hands [CoRL 2024]☆38Mar 16, 2025Updated 11 months ago
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆45Oct 29, 2023Updated 2 years ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆226Oct 17, 2025Updated 4 months ago
- [CoRL25] GraspVLA: a Grasping Foundation Model Pre-trained on Billion-scale Synthetic Action Data☆341Dec 29, 2025Updated 2 months ago
- Official code for "From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation"☆30Jul 10, 2025Updated 7 months ago
- Vision-Language-Action Optimization with Trajectory Ensemble Voting☆25Feb 18, 2026Updated last week
- Official Code for SGRv2 and SGR.☆33May 20, 2025Updated 9 months ago
- ☆62Dec 14, 2024Updated last year
- TACO-RL: Latent Plans for Task-Agnostic Offline Reinforcement Learning☆30Jan 26, 2023Updated 3 years ago
- ☆14Feb 13, 2025Updated last year
- ☆68Jan 8, 2025Updated last year
- [NeurIPS 2022] ASPiRe: Adaptive Skill Priors for Reinforcement Learning☆13Oct 19, 2022Updated 3 years ago
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆115Apr 14, 2025Updated 10 months ago
- Code for paper on ICRA 2022 workshop on Deformable Object Manipulation. In this work we learn keypoints from synthetic data for robotic c…☆15Aug 6, 2024Updated last year
- Code for the paper "3D FlowMatch Actor: Unified 3D Policy for Single- and Dual-Arm Manipulation"☆32Aug 18, 2025Updated 6 months ago
- Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"☆208May 30, 2025Updated 9 months ago
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…☆980Dec 20, 2025Updated 2 months ago
- This is the repo of CoRL 2024 paper "Learning to Manipulate Anywhere: A Visual Generalizable Framework For Reinforcement Learning"☆83Dec 13, 2024Updated last year
- This is the source code to paper “DAgger Diffusion Navigation: DAgger Boosted Diffusion Policy for Vision-Language Navigation”.☆30Aug 13, 2025Updated 6 months ago
- [RSS 2025 Best Systems Paper Finalist] 💐Official implementation of "Learning Humanoid Standing-up Control across Diverse Postures"☆520Jun 17, 2025Updated 8 months ago
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning☆56Apr 1, 2025Updated 11 months ago