SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards
☆39Jan 28, 2026Updated 4 months ago
Alternatives and similar repositories for SpatialThinker
Users that are interested in SpatialThinker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code implementation of the paper "World-in-World: World Models in a Closed-Loop World" (ICLR'26 Oral)☆168Apr 3, 2026Updated last month
- ☆55Apr 7, 2026Updated last month
- Spatial Aptitude Training for Multimodal Langauge Models☆32Feb 8, 2026Updated 3 months ago
- Official implementation of "What does CLIP know about a red circle? Visual Prompt Engineering for VLMs", ICCV 2023☆12Sep 21, 2023Updated 2 years ago
- Training recipe for SpatialReasoner [NeurIPS 2025]☆45Apr 5, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation☆62Jan 5, 2026Updated 4 months ago
- Use deep learning to learn Koopman operator and LQR for optimal control☆18Sep 28, 2020Updated 5 years ago
- Implementation of Language-Conditioned Path Planning (Amber Xie, Youngwoon Lee, Pieter Abbeel, Stephen James)☆26Sep 1, 2023Updated 2 years ago
- A simple visual test-time scaling method for GUI agent grounding☆26Dec 7, 2025Updated 5 months ago
- Official repo for "All Roads Lead to Rome: Incentivizing Divergent Thinking in Vision-Language Models" (CVPR 2026)☆71Apr 18, 2026Updated last month
- Heatmap-based Out-of-Distribution Detection (WACV 2023)☆13Mar 27, 2024Updated 2 years ago
- An implementation for MLLM oversensitivity evaluation☆18Nov 16, 2024Updated last year
- ☆12Nov 22, 2022Updated 3 years ago
- EARL: Editing with Autoregression and RL☆42Nov 21, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆25Oct 3, 2023Updated 2 years ago
- ☆17Dec 14, 2022Updated 3 years ago
- ☆13Feb 28, 2025Updated last year
- ☆18Dec 5, 2017Updated 8 years ago
- LDR and HDR pair Dataset☆19Mar 3, 2021Updated 5 years ago
- [CVPR 2021] Labeled from Unlabeled: Exploiting Unlabeled Data for Few-shot Deep HDR Deghosting☆18Dec 24, 2021Updated 4 years ago
- This is the official PyTorch implementation for DiffHDR: Towards High-quality HDR Deghosting with Conditional Diffusion Models (TCSVT'202…☆29Feb 12, 2024Updated 2 years ago
- MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence☆57Mar 11, 2026Updated 2 months ago
- Official implementation of paper "GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model", ICML 2025☆16Dec 25, 2025Updated 5 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything☆17Dec 6, 2023Updated 2 years ago
- REOBench: Benchmarking Robustness of Earth Observation Foundation Models☆24May 22, 2026Updated last week
- [ICRA 2026] StereoAdapter: Adapting Stereo Depth Estimation to Underwater Scenes☆24Feb 17, 2026Updated 3 months ago
- WACV 2025: Theory, Experiments, Dataset, and Code for our newly proposed LDR → HDR Deep Learning Dataset called GTA-HDR☆32May 3, 2026Updated 3 weeks ago
- ☆29Sep 2, 2025Updated 8 months ago
- Official codebase for the CVPR 2026 paper "Self-Evolving 3D Scene Generation from a Single Image"☆20Dec 15, 2025Updated 5 months ago
- 同济大学软件学院2023年秋软件工程课程笔记☆17Jan 16, 2024Updated 2 years ago
- the official repo for EMNLP 2024 (main) paper "EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimo…☆21Apr 9, 2025Updated last year
- Official code base for "Long-Tailed Diffusion Models With Oriented Calibration" ICLR2024☆18Jul 11, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆16Oct 31, 2024Updated last year
- Official implementation of the paper “Endowing Vision-Language Models with System 2 Thinking for Fine-Grained Visual Recognition,” AAAI 2…☆38Jan 30, 2026Updated 4 months ago
- ☆10Jun 20, 2025Updated 11 months ago
- Anatomy-aware self-supervised learning☆11Jun 22, 2024Updated last year
- The official implementation of the paper DADF for industrial VAD☆13Dec 1, 2023Updated 2 years ago
- ☆12Dec 23, 2022Updated 3 years ago
- Official PyTorch implementation of the paper Transformer-Based Image Generation from Scene Graphs https://arxiv.org/abs/2303.04634☆19Jan 30, 2024Updated 2 years ago