Chenyu-Wang567/All-Angles-Bench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Chenyu-Wang567/All-Angles-Bench)

Chenyu-Wang567 / All-Angles-Bench

Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs

☆69

Alternatives and similar repositories for All-Angles-Bench

Users that are interested in All-Angles-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

InternRobotics / MMSI-Bench
View on GitHub
[ICLR 2026] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence
☆103Apr 28, 2026Updated 2 months ago
Dwawayu / Pensieve
View on GitHub
The official implementation for "Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos".
☆49May 23, 2025Updated last year
beacon-3d / Beacon3D
View on GitHub
[CVPR 2025] Beacon3D: Object-centric Evaluation for 3D Grounding-QA
☆28Nov 25, 2025Updated 7 months ago
cvlab-kaist / SOLA
View on GitHub
Official implementation of "Referring Video Object Segmentation via Language Aligned Track Selection".
☆41Jun 2, 2025Updated last year
MINT-SJTU / STI-Bench
View on GitHub
STI-Bench : Are MLLMs Ready for Precise Spatial-Temporal World Understanding?
☆39Jan 12, 2026Updated 6 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
OuyangKun10 / SpaceR
View on GitHub
SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning
☆111Jul 9, 2025Updated last year
neu-vi / struct2d
View on GitHub
Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs' (NeurIPS 2025)
☆31Oct 28, 2025Updated 8 months ago
cvlab-kaist / MUG-VOS
View on GitHub
Official Implementation of "Multi-Granularity Video Object Segmentation" (AAAI 2025)
☆25Dec 20, 2024Updated last year
Haochen-Wang409 / ross3d
View on GitHub
[ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
☆70Jul 22, 2025Updated 11 months ago
vision-x-nyu / thinking-in-space
View on GitHub
Official repo and evaluation implementation of VSI-Bench
☆732Aug 5, 2025Updated 11 months ago
danielchyeh / this-is-my
View on GitHub
Official This-Is-My Dataset published in CVPR 2023
☆16Jul 18, 2024Updated 2 years ago
mll-lab-nu / MindCube
View on GitHub
☆163Mar 23, 2026Updated 3 months ago
KAIST-Visual-AI-Group / APC-VLM
View on GitHub
[ICCV 2025] Official code for Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation
☆66Sep 12, 2025Updated 10 months ago
cvlab-kaist / SpikeMatch
View on GitHub
☆19Sep 29, 2025Updated 9 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ZJU-REAL / ViewSpatial-Bench
View on GitHub
[ECCV 2026] ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models
☆82Mar 9, 2026Updated 4 months ago
cvlab-kaist / ReNoV
View on GitHub
☆21Feb 14, 2026Updated 5 months ago
SceneCOT / scenecot
View on GitHub
[ICLR 2026] SceneCOT: Eliciting Grounded Chain-of-Thought Reasoning in 3D Scenes
☆27Mar 22, 2026Updated 3 months ago
sled-group / COMFORT
View on GitHub
[ICLR 2025 Oral] Official Implementation for "Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Un…
☆22Oct 24, 2024Updated last year
LaVi-Lab / VG-LLM
View on GitHub
The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'
☆245Nov 28, 2025Updated 7 months ago
LogosRoboticsGroup / SPAR
View on GitHub
From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perceptio…
☆90Jan 5, 2026Updated 6 months ago
cvlab-kaist / PixelCLIP
View on GitHub
Official Implementation of "Towards Open-Vocabulary Semantic Segmentation without Semantic Labels" (NeurIPS 2024)
☆57Oct 7, 2024Updated last year
VITA-Group / VLM-3R
View on GitHub
[CVPR 2026] VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction
☆428Updated this week
LaVi-Lab / Video-3D-LLM
View on GitHub
[CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.
☆218Jun 4, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
hyungjin-chung / VPS
View on GitHub
☆16Sep 11, 2025Updated 10 months ago
Visual-AI / 3DRS
View on GitHub
[NeurIPS 2025] 3DRS: MLLMs Need 3D-Aware Representation Supervision for Scene Understanding
☆158Dec 9, 2025Updated 7 months ago
cvlab-kaist / Vid-CamEdit
View on GitHub
Official Implementation of "Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry"
☆32Nov 10, 2025Updated 8 months ago
Zhoues / RoboRefer
View on GitHub
[NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"
☆263Dec 16, 2025Updated 7 months ago
mll-lab-nu / Theory-of-Space
View on GitHub
THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to bui…
☆85Feb 27, 2026Updated 4 months ago
ttchengab / continuous_3d_words_code
View on GitHub
☆66Jun 27, 2024Updated 2 years ago
cvlab-kaist / ZeroCo
View on GitHub
CVPR 2025 (Highlight) : Official implementation of "Cross-View Completion Models are Zero-shot Correspondence Estimators"
☆70Jun 23, 2025Updated last year
KAIST-Visual-AI-Group / VG-AVS
View on GitHub
Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection
☆24Feb 5, 2026Updated 5 months ago
cvlab-kaist / Visual-Persona
View on GitHub
Official implementation of "Visual Persona: Foundation Model for Full-Body Human Customization" (CVPR 2025)
☆48Feb 20, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sangminwoo / DMP
View on GitHub
[AAAI 2025] Official pytorch implementation of "Diffusion Model Patching via Mixture-of-Prompts"
☆13Dec 12, 2024Updated last year
byeongjun-park / SteerX
View on GitHub
[ICCV 2025] Official pytorch implementation of "SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering"
☆50Mar 20, 2025Updated last year
cvlab-kaist / URECA
View on GitHub
Official implementation of "URECA : Unique Region Caption Anything"
☆58Jul 13, 2025Updated last year
cvlab-kaist / InterRVOS
View on GitHub
Official implementation of "InterRVOS: Interaction-aware Referring Video Object Segmentation".
☆31May 1, 2026Updated 2 months ago
WanyueZhang-ai / spatial-understanding
View on GitHub
☆19Sep 3, 2025Updated 10 months ago
zoezheng126 / Spatio-Temporal-LLM
View on GitHub
☆19Aug 7, 2025Updated 11 months ago
transductive-visualprogram / tvp
View on GitHub
☆15Jan 7, 2026Updated 6 months ago