ChenVoid / CombatVLALinks
[ICCV 2025] CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games
☆16Updated 2 months ago
Alternatives and similar repositories for CombatVLA
Users that are interested in CombatVLA are comparing it to the libraries listed below
Sorting:
- ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models☆58Updated 3 months ago
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆187Updated 3 months ago
- Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks☆163Updated 3 months ago
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.☆179Updated last month
- A paper list for spatial reasoning☆134Updated 2 months ago
- [World-Model-Survey-2024] Paper list and projects for World Model☆15Updated 10 months ago
- ☆80Updated last month
- OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding☆59Updated last month
- [CVPR 2025] EgoLife: Towards Egocentric Life Assistant☆322Updated 5 months ago
- TStar is a unified temporal search framework for long-form video question answering☆63Updated last week
- RynnVLA-001: A Vision-Language-Action Model Boosted by Generative Priors☆153Updated 3 weeks ago
- ☆85Updated last month
- Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens (arXiv 2025)☆151Updated last month
- [ICCV 2025] FonTS: Text Rendering with Typography and Style Controls☆23Updated last week
- Pixel-Level Reasoning Model trained with RL☆201Updated this week
- ☆30Updated 8 months ago
- Data and Code for Paper IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks☆26Updated 3 weeks ago
- Visual Planning: Let's Think Only with Images☆269Updated 3 months ago
- VIKI‑R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning☆39Updated this week
- [CVPR2024] This is the official implement of MP5☆103Updated last year
- [ICCV 2025] RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints☆72Updated last week
- Official repo and evaluation implementation of VSI-Bench☆583Updated 3 weeks ago
- Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration☆77Updated 3 months ago
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆328Updated 2 months ago
- ☆23Updated last week
- Official code for MotionBench (CVPR 2025)☆55Updated 6 months ago
- The official repository for our paper, "Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning".☆136Updated last month
- [ICLR2025] Official code implementation of Video-UTR: Unhackable Temporal Rewarding for Scalable Video MLLMs☆58Updated 6 months ago
- This is the official code of VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding (ECCV 2024)☆249Updated 9 months ago
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.☆281Updated 3 weeks ago