apple / ml-space-benchmark
Code and data for "Does Spatial Cognition Emerge in Frontier Models?"
☆13Updated 3 weeks ago
Alternatives and similar repositories for ml-space-benchmark:
Users that are interested in ml-space-benchmark are comparing it to the libraries listed below
- ☆42Updated last year
- ☆10Updated last week
- Code for paper "Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning"☆36Updated last year
- EgoTV Egocentric Task Verification from Natural Language Task Descriptions☆27Updated last year
- Evaluate Multimodal LLMs as Embodied Agents☆46Updated 2 months ago
- Official Code for Neural Systematic Binder☆32Updated 2 years ago
- Official implementation for CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding☆45Updated last year
- 🐍 A Python Package for Seamless Data Distribution in AI Workflows☆22Updated last year
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆58Updated 7 months ago
- Language Repository for Long Video Understanding☆31Updated 10 months ago
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆57Updated 2 months ago
- LogiCity@NeurIPS'24, D&B track. A multi-agent inductive learning environment for "abstractions".☆22Updated 6 months ago
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆36Updated 2 years ago
- Code and data for the paper "Emergent Visual-Semantic Hierarchies in Image-Text Representations" (ECCV 2024)☆28Updated 9 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆85Updated last year
- ☆76Updated 8 months ago
- ☆16Updated last month
- ☆33Updated last year
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Updated 2 years ago
- Codebase for HiP☆89Updated last year
- Official repo of the ICLR 2025 paper "MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos"☆25Updated 7 months ago
- General-purpose Visual Understanding Evaluation☆20Updated last year
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆20Updated last year
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆47Updated last year
- Official Repository of NeurIPS2021 paper: PTR☆33Updated 3 years ago
- Code for "Is CLIP ideal? No. Can we fix it? Yes!"☆15Updated 2 months ago
- Scaffold Prompting to promote LMMs☆40Updated 4 months ago
- A paper list of world model☆27Updated last month
- Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models☆108Updated last year
- Recursive Visual Programming (ECCV 2024)☆17Updated 5 months ago