ChenVoid / CombatVLALinks
[ICCV 2025] CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games
☆19Updated 3 months ago
Alternatives and similar repositories for CombatVLA
Users that are interested in CombatVLA are comparing it to the libraries listed below
Sorting:
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆190Updated 4 months ago
- ☆89Updated last month
- OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding☆59Updated 2 months ago
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆350Updated 3 months ago
- A paper list for spatial reasoning☆139Updated 3 months ago
- ☆30Updated 9 months ago
- Virtual Community: An Open World for Humans, Robots, and Society☆172Updated this week
- RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation☆208Updated this week
- Official code for MotionBench (CVPR 2025)☆58Updated 6 months ago
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆152Updated 4 months ago
- 📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.☆271Updated last week
- ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models☆59Updated 3 months ago
- [NeurIPS 2025] Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration☆82Updated 3 months ago
- [CVPR 2025] EgoLife: Towards Egocentric Life Assistant☆330Updated 6 months ago
- The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"☆40Updated last month
- Accepted by CVPR 2024☆38Updated last year
- ☆84Updated 2 months ago
- [World-Model-Survey-2024] Paper list and projects for World Model☆15Updated 10 months ago
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.☆186Updated 2 months ago
- VIKI‑R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning☆48Updated 3 weeks ago
- Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks☆168Updated 3 months ago
- Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens (arXiv 2025)☆161Updated last month
- [arXiv 2025] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence☆52Updated last month
- [ICCV 2025] FonTS: Text Rendering with Typography and Style Controls☆27Updated last month
- InternVLA-M1: A Spatially Grounded Foundation Model for Generalist Robot Policy☆116Updated this week
- [ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos☆133Updated 4 months ago
- TStar is a unified temporal search framework for long-form video question answering☆67Updated 3 weeks ago
- [ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".☆155Updated 3 months ago
- [ICCV 2025] RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints☆82Updated 3 weeks ago
- SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning☆80Updated 2 months ago