Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs
☆65Jan 1, 2026Updated 2 months ago
Alternatives and similar repositories for All-Angles-Bench
Users that are interested in All-Angles-Bench are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] EOC-Bench, an innovative benchmark designed to systematically evaluate object-centric embodied cognition in dynamic egocen…☆22Jun 17, 2025Updated 8 months ago
- Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection☆22Feb 5, 2026Updated last month
- STI-Bench : Are MLLMs Ready for Precise Spatial-Temporal World Understanding?☆38Jan 12, 2026Updated last month
- [ICLR 2025 Oral] Official Implementation for "Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Un…☆21Oct 24, 2024Updated last year
- CVPR 2025 (Highlight) : Official implementation of "Cross-View Completion Models are Zero-shot Correspondence Estimators"☆62Jun 23, 2025Updated 8 months ago
- The official implementation for "Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos".☆49May 23, 2025Updated 9 months ago
- Official Implementation of "Multi-Granularity Video Object Segmentation" (AAAI 2025)☆25Dec 20, 2024Updated last year
- Official Implementation of "Towards Open-Vocabulary Semantic Segmentation without Semantic Labels" (NeurIPS 2024)☆53Oct 7, 2024Updated last year
- Official implementation of "CAMEO: Correspondence-Attention Alignment for Multi-View Diffusion Models"☆39Feb 24, 2026Updated last week
- ☆14Sep 11, 2025Updated 5 months ago
- [AAAI 2025] Official pytorch implementation of "Diffusion Model Patching via Mixture-of-Prompts"☆13Dec 12, 2024Updated last year
- ☆20Oct 15, 2025Updated 4 months ago
- [AAAI 2025] Official Implementation of I-HallA v1.0☆13Feb 2, 2025Updated last year
- Official Implementation of "Geometrically-Constrained Agent for Spatial Reasoning"☆63Dec 18, 2025Updated 2 months ago
- [SIGGRAPH Asia 2025] The official repo for the conference paper "MV-Performer: Taming Video Diffusion Model for Faithful and Synchronized…☆39Dec 13, 2025Updated 2 months ago
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆35Feb 26, 2026Updated last week
- [ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness☆67Jul 22, 2025Updated 7 months ago
- [ICCV'25] Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆82Jul 6, 2025Updated 8 months ago
- [CVPR 2025] Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation☆23Nov 17, 2025Updated 3 months ago
- ☆14Oct 12, 2024Updated last year
- Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs' (NeurIPS 2025)☆30Oct 28, 2025Updated 4 months ago
- [WACV 2026] MomentMix Augmentation with Length-Aware DETR for Temporally Robust Moment Retrieval☆13Sep 18, 2025Updated 5 months ago
- [ICCV'25] ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment☆36Oct 5, 2025Updated 5 months ago
- ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models☆70Jan 8, 2026Updated 2 months ago
- Official implementation of "Referring Video Object Segmentation via Language Aligned Track Selection".☆40Jun 2, 2025Updated 9 months ago
- A precise and stable CFG for negative prompts, derived via guided sampling with contrastive loss.☆14Dec 27, 2024Updated last year
- Official This-Is-My Dataset published in CVPR 2023☆16Jul 18, 2024Updated last year
- Research Papers on Efficient Neural Fields from EffL Group☆16Apr 21, 2025Updated 10 months ago
- Official repo and evaluation implementation of VSI-Bench☆675Aug 5, 2025Updated 7 months ago
- Official implementation of the WACV 2025 paper "3D Part Segmentation via Geometric Aggregation of 2D Visual Features"☆25Jun 8, 2025Updated 9 months ago
- THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to bui…☆40Feb 27, 2026Updated last week
- ☆16May 26, 2023Updated 2 years ago
- Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward☆60Nov 27, 2025Updated 3 months ago
- [NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D☆229Feb 11, 2026Updated 3 weeks ago
- Official implementation of "Visual Persona: Foundation Model for Full-Body Human Customization" (CVPR 2025)☆45Feb 20, 2026Updated 2 weeks ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆41Oct 20, 2025Updated 4 months ago
- [NeurIPS'25] Official implementation of "Emergent Temporal Correspondences from Video Diffusion Models"☆95Dec 3, 2025Updated 3 months ago
- [ICLR 2025] DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models☆19Mar 25, 2025Updated 11 months ago
- [NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"☆235Dec 16, 2025Updated 2 months ago