fscdc / ReasonMapLinks
[arXiv 2025] Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps
☆68Updated 2 weeks ago
Alternatives and similar repositories for ReasonMap
Users that are interested in ReasonMap are comparing it to the libraries listed below
Sorting:
- ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO☆74Updated 5 months ago
- Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing☆76Updated 3 months ago
- ☆41Updated 5 months ago
- ☆18Updated 5 months ago
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"☆14Updated last year
- A paper list for spatial reasoning☆157Updated last week
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆59Updated last year
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆24Updated 5 months ago
- [AAAI 2024] Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-Supervised 3D Object Detection☆10Updated 9 months ago
- 4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)☆115Updated 7 months ago
- ☆30Updated this week
- SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning☆93Updated 4 months ago
- ☆53Updated 5 months ago
- https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT☆96Updated last week
- STI-Bench : Are MLLMs Ready for Precise Spatial-Temporal World Understanding?☆30Updated 4 months ago
- ☆30Updated 3 weeks ago