alibaba-damo-academy / EOCBenchView external linksLinks
[NeurIPS 2025] EOC-Bench, an innovative benchmark designed to systematically evaluate object-centric embodied cognition in dynamic egocentric scenarios.
☆22Jun 17, 2025Updated 8 months ago
Alternatives and similar repositories for EOCBench
Users that are interested in EOCBench are comparing it to the libraries listed below
Sorting:
- Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs' (NeurIPS 2025)☆30Oct 28, 2025Updated 3 months ago
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆62Jan 1, 2026Updated last month
- STI-Bench : Are MLLMs Ready for Precise Spatial-Temporal World Understanding?☆36Jan 12, 2026Updated last month
- ☆14Jul 11, 2024Updated last year
- This is a project on visual spatial reasoning tasks-SIBench☆25Jan 12, 2026Updated last month
- [ICCV 2025] Official pytorch implementation of "SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering"☆49Mar 20, 2025Updated 10 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆46Aug 13, 2025Updated 6 months ago
- [ECCV 2024 Oral] The official implementation of paper: COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation☆11Aug 13, 2024Updated last year
- [ACCV 2024 Poster] official code for "VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model"☆10Sep 28, 2024Updated last year
- ☆15Sep 11, 2025Updated 5 months ago
- Official implementation of paper "GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model", ICML 2025☆15Dec 25, 2025Updated last month
- [CVPR 2025] GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector☆16Mar 19, 2025Updated 10 months ago
- [CHI24] AI-Assisted In-Context Writing on OHMD During Travels☆11Dec 19, 2024Updated last year
- TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics☆21Nov 18, 2025Updated 2 months ago
- ☆20Oct 15, 2025Updated 4 months ago
- ☆14Nov 23, 2024Updated last year
- Implementation of VLM4VLA☆116Feb 2, 2026Updated 2 weeks ago
- [CVPR 2025 Highlight🔥] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuni…☆127Jan 30, 2026Updated 2 weeks ago
- PlaneRecTR: Unified Query Learning for 3D Plane Recovery from a Single View☆48Sep 11, 2024Updated last year
- ☆14Jul 14, 2023Updated 2 years ago
- ICCV 2019 Tutorial: Global Optimization for Geometric Understanding with Provable Guarantees☆15Oct 20, 2022Updated 3 years ago
- Official implementation of the paper "Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and Text Guidance" (WACV 2025)☆16Mar 5, 2025Updated 11 months ago
- ☆21Sep 16, 2025Updated 5 months ago
- Benchmarking Multi-Image Understanding in Vision and Language Models☆12Jul 29, 2024Updated last year
- ☆21Apr 5, 2025Updated 10 months ago
- Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection☆19Feb 5, 2026Updated last week
- (NeurIPS 2025 D&B Track) OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps☆25Jan 22, 2026Updated 3 weeks ago
- Companion website for the article Learning Long-Range Perception Using Self-Supervision from Short-Range Sensors and Odometry☆12Sep 27, 2022Updated 3 years ago
- PErception and Robotic Learning System v2☆12Mar 31, 2023Updated 2 years ago
- ☆33May 29, 2025Updated 8 months ago
- ☆12Feb 23, 2024Updated last year
- Repo for running various baselines with Behavior-1K☆33Nov 7, 2025Updated 3 months ago
- Implementation for the PHM paper at ICLR'21☆13Mar 1, 2023Updated 2 years ago
- marching squares algorithm in python☆11Jul 21, 2022Updated 3 years ago
- This is the source code for our ICLR 2025 work EqNIO☆23Apr 25, 2025Updated 9 months ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆14Sep 30, 2023Updated 2 years ago
- ☆14Jul 23, 2024Updated last year
- ☆22Jun 5, 2025Updated 8 months ago
- A multimodal UAV assistant dataset.☆11Jun 14, 2021Updated 4 years ago