[AAAI26 oral] CronusVLA: Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling
☆91Jan 11, 2026Updated 2 months ago
Alternatives and similar repositories for CronusVLA
Users that are interested in CronusVLA are comparing it to the libraries listed below
Sorting:
- [ICLR 2026] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation☆108Jan 27, 2026Updated last month
- [ICLR 2026] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence☆80Mar 13, 2026Updated last week
- InternRobotics' open-source toolbox for vision-based embodied spatial intelligence.☆48Sep 18, 2025Updated 6 months ago
- [NeurIPS 2025] OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding☆73Sep 29, 2025Updated 5 months ago
- [RSS 2025] Gripper Keypose and Object Pointflow as Interfaces for Bimanual Robotic Manipulation☆78Jul 22, 2025Updated 7 months ago
- A versatile, all-in-one toolbox for whole-body humanoid robot control.☆176Oct 10, 2025Updated 5 months ago
- Code & data for "RoboGround: Robotic Manipulation with Grounded Vision-Language Priors" (CVPR 2025)☆43May 25, 2025Updated 9 months ago
- An All-in-one robot manipulation learning suite for policy models training and evaluation on various datasets and benchmarks.☆169Oct 15, 2025Updated 5 months ago
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation☆283Jul 8, 2025Updated 8 months ago
- [RSS 2025] Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation☆169Feb 22, 2026Updated 3 weeks ago
- [ICRA 2026] Re3Sim: Generating High-Fidelity Simulation Data via 3D-Photorealistic Real-to-Sim for Robotic Manipulation☆135Updated this week
- [ICRA 2026] Official implementation of "Towards Adaptable Humanoid Control via Adaptive Motion Tracking"☆203Oct 17, 2025Updated 5 months ago
- [CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI☆658Jun 13, 2025Updated 9 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆231Oct 17, 2025Updated 5 months ago
- InternRobotics' open platform for building generalized navigation foundation models.☆732Mar 10, 2026Updated last week
- InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy☆383Feb 11, 2026Updated last month
- ☆63Dec 14, 2024Updated last year
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆46Oct 29, 2023Updated 2 years ago
- [RSS 2025 Best Systems Paper Finalist] 💐Official implementation of "Learning Humanoid Standing-up Control across Diverse Postures"☆546Jun 17, 2025Updated 9 months ago
- a brief repo about paper research☆15Sep 4, 2024Updated last year
- A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic Manipulation☆61Apr 1, 2025Updated 11 months ago
- Official implementation of the paper: "NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance"☆551Jan 12, 2026Updated 2 months ago
- ☆41Mar 19, 2025Updated last year
- A simulation platform for versatile Embodied AI research and developments.☆1,219Sep 4, 2025Updated 6 months ago
- [CoRL25] GraspVLA: a Grasping Foundation Model Pre-trained on Billion-scale Synthetic Action Data☆349Dec 29, 2025Updated 2 months ago
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling☆580Oct 26, 2025Updated 4 months ago
- Official implementation of EgoHOD at ICLR 2025; 14 EgoVis Challenge Winners in CVPR 2024☆32Nov 25, 2025Updated 3 months ago
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…☆1,005Dec 20, 2025Updated 3 months ago
- Official code for "From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation" (ICLR2026)☆31Mar 1, 2026Updated 2 weeks ago
- [CVPR 2025] VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation☆45Jun 20, 2025Updated 9 months ago
- Open-sourced code for "HOMIE: Humanoid Loco-Manipulation with Isomorphic Exoskeleton Cockpit".☆533Sep 1, 2025Updated 6 months ago
- Twisting Lids Off with Two Hands [CoRL 2024]☆39Mar 16, 2025Updated last year
- ☆69Jan 8, 2025Updated last year
- This is the repo of CoRL 2024 paper "Learning to Manipulate Anywhere: A Visual Generalizable Framework For Reinforcement Learning"☆84Dec 13, 2024Updated last year
- ☆21Oct 31, 2024Updated last year
- [CVPR 2025] Horizon-GS: Unified 3D Gaussian Splatting for Large-Scale Aerial-to-Ground Scenes☆129Jul 5, 2025Updated 8 months ago
- [ECCV 2024 Best Paper Candidate & TPAMI 2025] PointLLM: Empowering Large Language Models to Understand Point Clouds☆984Updated this week
- Code&Data for Grounded 3D-LLM with Referent Tokens☆134Jan 5, 2025Updated last year
- [ICRA 2026] Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"☆426Nov 2, 2025Updated 4 months ago