Spatial Aptitude Training for Multimodal Langauge Models
☆24Feb 8, 2026Updated 3 weeks ago
Alternatives and similar repositories for SAT
Users that are interested in SAT are comparing it to the libraries listed below
Sorting:
- TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics☆22Nov 18, 2025Updated 3 months ago
- Benchmarking Multi-Image Understanding in Vision and Language Models☆12Jul 29, 2024Updated last year
- ☆46Feb 18, 2026Updated 2 weeks ago
- [ECCV'24] 3D Reconstruction of Objects in Hands without Real World 3D Supervision☆17Feb 3, 2025Updated last year
- Official implementation of StochSync: a zero-shot approach for image generation in arbitrary spaces via stochastic diffusion synchronizat…☆21Jun 24, 2025Updated 8 months ago
- [ICLR 2025 Oral] Official Implementation for "Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Un…☆21Oct 24, 2024Updated last year
- [Awesome-Spatial-VLMs] This repository is the official, community-maintained resource for the survey paper: Spatial Intelligence in Visio…☆64Feb 16, 2026Updated 2 weeks ago
- Official implementation for the paper"Towards Understanding How Knowledge Evolves in Large Vision-Language Models"☆28Apr 10, 2025Updated 10 months ago
- Source code for EyeRobot☆41Dec 1, 2025Updated 3 months ago
- LogiCity@NeurIPS'24, D&B track. A multi-agent inductive learning environment for "abstractions".☆27Jun 10, 2025Updated 8 months ago
- Code release for SceneReplica paper.☆29Jul 24, 2025Updated 7 months ago
- Code base for zero-shot action localization through spatial-aware object embeddings☆25Nov 3, 2017Updated 8 years ago
- STI-Bench : Are MLLMs Ready for Precise Spatial-Temporal World Understanding?☆37Jan 12, 2026Updated last month
- A Vision-Language Model for Spatial Affordance Prediction in Robotics☆214Jul 17, 2025Updated 7 months ago
- Implementation of Prompting with the Future: Open-World Model Predictive Control with Interactive Digital Twins. [RSS 2025]☆49Oct 21, 2025Updated 4 months ago
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆71Feb 28, 2024Updated 2 years ago
- Official implementation of AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memories☆78Feb 17, 2026Updated 2 weeks ago
- ☆56Aug 7, 2025Updated 6 months ago
- (3DV 2026 Oral) L4P -- a feed-forward foundational model designed for multiple low-level 4D vision perception tasks.☆60Dec 9, 2025Updated 2 months ago
- ☆42Jul 9, 2025Updated 7 months ago
- A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic Manipulation☆58Apr 1, 2025Updated 11 months ago
- Code Repository for ControlVLA, CoRL2025.☆85Oct 26, 2025Updated 4 months ago
- [ICLR'25] Official Implementation of STAMP: Scalable Task And Model-agnostic Collaborative Perception☆56Feb 4, 2025Updated last year
- TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics☆42Updated this week
- ☆11May 3, 2019Updated 6 years ago
- ☆20Sep 5, 2025Updated 5 months ago
- Scaffold Prompting to promote LMMs☆46Dec 16, 2024Updated last year
- ☆13Nov 5, 2024Updated last year
- the datasets of our paper☆11Feb 26, 2024Updated 2 years ago
- ☆20Aug 22, 2025Updated 6 months ago
- Multi-Agent LLM System for Digital Scam Protection☆12Dec 19, 2024Updated last year
- Tercera y última parte de la saga de métodos numéricos con Python☆11May 30, 2022Updated 3 years ago
- [ICRA 2025] RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning☆41Oct 10, 2024Updated last year
- ☆77Aug 29, 2025Updated 6 months ago
- Distributed, scalable benchmarking of generalist robot policies.☆84Feb 10, 2026Updated 3 weeks ago
- IBM Quantum Challenge Fall 2023☆10May 23, 2023Updated 2 years ago
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆20Feb 25, 2026Updated last week
- [NeurIPS 2025] EOC-Bench, an innovative benchmark designed to systematically evaluate object-centric embodied cognition in dynamic egocen…☆22Jun 17, 2025Updated 8 months ago
- ☆14Mar 5, 2024Updated 2 years ago