☆12Jan 10, 2025Updated last year
Alternatives and similar repositories for Spatial-MM
Users that are interested in Spatial-MM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.☆143Mar 25, 2023Updated 3 years ago
- A CustomNet node for ComfyUI☆10Aug 11, 2024Updated last year
- 这个仓库包含了我在上人工智能课时完成的拼音输入法作业。☆11Feb 16, 2022Updated 4 years ago
- VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning☆36Jul 15, 2025Updated 9 months ago
- ☆83Nov 5, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆23Dec 4, 2024Updated last year
- [ICLR 2025 Oral] Official Implementation for "Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Un…☆21Oct 24, 2024Updated last year
- [CVPR 2026 Fingdings] This repo is the official implementation of "Euclid’s Gift: Enhancing Spatial Perception and Reasoning in Vision‑La…☆28Mar 15, 2026Updated last month
- A curated lists of self-taught materials including research blogs☆16Dec 12, 2016Updated 9 years ago
- ☆14Dec 17, 2018Updated 7 years ago
- ☆31Jun 25, 2024Updated last year
- Experiments with representation engineering☆14Feb 28, 2024Updated 2 years ago
- [NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"☆317Dec 14, 2024Updated last year
- Computer Network : A Top-Down Approach 8th Resource and Homework☆15Apr 23, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Deep learning and spectral embedding for graph partitioning☆14May 13, 2022Updated 3 years ago
- [EMNLP 2024] Official repository for paper "From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis"☆21Oct 15, 2024Updated last year
- Official Github repository for paper "Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates"☆14Mar 22, 2024Updated 2 years ago
- [ICLR '25] Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"☆102Nov 30, 2025Updated 4 months ago
- An API that detect expiration date from the product package's picture based on Deep Learning Algorithms☆12Jun 4, 2022Updated 3 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Jan 23, 2018Updated 8 years ago
- text classification compitioin top 10 strategy☆18Aug 14, 2021Updated 4 years ago
- Building Llama 3 from scratch using PyTorch☆13Sep 1, 2024Updated last year
- ☆12Dec 20, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning☆110Jul 9, 2025Updated 9 months ago
- ☆102Jan 27, 2026Updated 2 months ago
- Standardization Project for mjai Format Specification☆12Aug 28, 2024Updated last year
- [NeurIPS24] VisMin: Visual Minimal-Change Understanding☆19Mar 3, 2025Updated last year
- ☆12Jan 31, 2023Updated 3 years ago
- ☆47Nov 8, 2024Updated last year
- Codes and data for AAAI-24 paper "Advancing Spatial Reasoning in Large Language Models: An In-depth Evaluation and Enhancement Using the …☆14Apr 23, 2024Updated last year
- Official Pytorch Code Implementation for "UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control", accepted by ICML 2…☆32Jan 30, 2026Updated 2 months ago
- Python script that splits videos into individual frames.☆11Dec 21, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆14Jun 22, 2022Updated 3 years ago
- TongjiThesis Docker 环境 | Docker environment for TongjiThesis (Tongji University thesis LaTeX template)☆12Mar 28, 2026Updated 2 weeks ago
- 日麻牌理分析☆11Feb 9, 2026Updated 2 months ago
- dmps code☆39Jan 24, 2024Updated 2 years ago
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆71Feb 28, 2024Updated 2 years ago
- This repository contains the replication of the iGSM dataset generation process from the Physics of LLM paper by Zeyuan Zhu.☆17Sep 13, 2024Updated last year
- Codebase for VidHal: Benchmarking Hallucinations in Vision LLMs☆14Apr 19, 2025Updated 11 months ago