ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models
☆70Jan 8, 2026Updated last month
Alternatives and similar repositories for ViewSpatial-Bench
Users that are interested in ViewSpatial-Bench are comparing it to the libraries listed below
Sorting:
- ☆36Oct 9, 2025Updated 4 months ago
- Benchmarking agent reasoning capabilities in physical interactions, tool usage, and multi-agent coordination.☆42Aug 10, 2025Updated 6 months ago
- ☆32Aug 11, 2025Updated 6 months ago
- GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts☆39Sep 30, 2025Updated 5 months ago
- ☆25Aug 19, 2025Updated 6 months ago
- [ACM MM 2025] SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation. https://arxiv.org/abs/2506.03139☆75Nov 10, 2025Updated 3 months ago
- [CVPR 2026] SpatialScore: Towards Comprehensive Evaluation for Spatial Intelligence☆63Jul 9, 2025Updated 7 months ago
- Github repository for "Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas" (ICML 2025)☆70May 2, 2025Updated 9 months ago
- ☆79Nov 5, 2024Updated last year
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆306Feb 11, 2026Updated 2 weeks ago
- [CVPR25 Highlight] A ChatGPT-Prompted Visual hallucination Evaluation Dataset, featuring over 100,000 data samples and four advanced eval…☆31Apr 16, 2025Updated 10 months ago
- STI-Bench : Are MLLMs Ready for Precise Spatial-Temporal World Understanding?☆37Jan 12, 2026Updated last month
- ☆30Nov 18, 2025Updated 3 months ago
- SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning☆104Jul 9, 2025Updated 7 months ago
- ☆41Jun 9, 2025Updated 8 months ago
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆30Oct 2, 2025Updated 5 months ago
- XL-VLMs: General Repository for eXplainable Large Vision Language Models☆46Sep 8, 2025Updated 5 months ago
- Official Implementation of Rethinking Score Distillation as a Bridge Between Image Distributions☆85Mar 26, 2025Updated 11 months ago
- 基于DPO算法微调语言大模型,简单好上手。☆50Jul 3, 2024Updated last year
- A Python script to delete all comment and submission data from a given Reddit account.☆11Jan 5, 2021Updated 5 years ago
- my final work in NLP class☆13Dec 22, 2024Updated last year
- Reimplementation of NeRF (Neural Radiance Fields) (ECCV2020)☆10May 4, 2023Updated 2 years ago
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- [ICLR 2026] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence☆78Updated this week
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆63Jan 1, 2026Updated 2 months ago
- 深度学习领域论文翻译+理解☆17Feb 25, 2022Updated 4 years ago
- Collection of Machine Learning examples using MLEK CMSIS-pack.☆10Feb 18, 2026Updated last week
- ☆13Jul 22, 2022Updated 3 years ago
- A PyTorch implementation of the paper "MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis".☆12Jan 16, 2023Updated 3 years ago
- ☆13Feb 3, 2026Updated 3 weeks ago
- [ICML 2024] Fine-Grained Classes and How to Find Them☆13Jun 21, 2024Updated last year
- ReSemAct: Advancing Fine-Grained Robotic Manipulation via Semantic Structuring and Affordance Refinement☆17Jan 5, 2026Updated last month
- Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals☆12May 24, 2024Updated last year
- Official implementation of paper "GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model", ICML 2025☆15Dec 25, 2025Updated 2 months ago
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 2 years ago
- 一个 Windows 端高效 3D 重建平台,集成 OpenMVG 和 OpenMVS,覆盖从稀疏点云到完整贴图 Mesh 的完整流程;基于 Qt5,自动可视化重建,界面直观,操作简单。☆21Jan 16, 2025Updated last year
- The officalimplement of dLLM-Factory☆26Jul 12, 2025Updated 7 months ago
- [AAAI 2026] GUI-G²: Gaussian Reward Modeling for GUI Grounding☆303Feb 2, 2026Updated 3 weeks ago
- ☆18Aug 7, 2025Updated 6 months ago