dochouyi / SUCCLinks
☆11Updated last year
Alternatives and similar repositories for SUCC
Users that are interested in SUCC are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"☆228Updated last month
- [NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆286Updated last month
- Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.☆524Updated last year
- Code of paper "HyperVLA: Efficient Inference in Vision-Language-Action Models via Hypernetworks"☆20Updated 4 months ago
- Official code for the CVPR 2025 paper "Navigation World Models".☆528Updated 2 months ago
- [World-Model-Survey-2024] Paper list and projects for World Model☆15Updated last year
- 😎 A curated list of CVPR 2025 Oral paper. Total 96☆60Updated 2 months ago
- [CVPR2024] This is the official implement of MP5☆106Updated last year
- [NeurIPS-2024] The offical Implementation of "Instruction-Guided Visual Masking"☆41Updated last year
- [ICLR 2026] Unified Vision-Language-Action Model☆274Updated 3 months ago
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.☆381Updated 3 months ago
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.☆262Updated 3 months ago
- Efficiently apply modification functions to RLDS/TFDS datasets.☆29Updated last year
- [NIPS 2025] Open-World Drone Active Tracking with Goal-Centered Rewards☆17Updated 3 months ago
- HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model☆336Updated 4 months ago
- [TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"☆416Updated 10 months ago
- This is the completion of google's rt-1 project code and can run directly.☆37Updated last year
- InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy☆363Updated last month
- [NeurIPS 2025] VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation☆65Updated 4 months ago
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆335Updated 4 months ago
- A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation☆405Updated 3 months ago
- [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'☆229Updated last year
- Repository for Vision-and-Language Navigation via Causal Learning (Accepted by CVPR 2024)☆98Updated 8 months ago
- ☆457Updated last week
- [ECCV 2024] Official implementation of NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models☆238Updated last year
- A curated list of large VLM-based VLA models for robotic manipulation.☆339Updated last month
- [Actively Maintained🔥] A list of Embodied AI papers accepted by top conferences (ICLR, NeurIPS, ICML, RSS, CoRL, ICRA, IROS, CVPR, ICCV,…☆472Updated 2 months ago
- Thinking in 360°: Humanoid Visual Search in the Wild☆115Updated 2 weeks ago
- Latest Advances on Embodied Multimodal LLMs (or Vison-Language-Action Models).☆121Updated last year
- [NeurIPS 2025 Spotlight] Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning.☆118Updated last month