FatemehShiri/Spatial-MM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FatemehShiri/Spatial-MM)

FatemehShiri / Spatial-MM

☆12

Alternatives and similar repositories for Spatial-MM

Users that are interested in Spatial-MM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cambridgeltl / visual-spatial-reasoning
View on GitHub
[TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.
☆149Mar 25, 2023Updated 3 years ago
smthemex / ComfyUI_CustomNet
View on GitHub
A CustomNet node for ComfyUI
☆10Aug 11, 2024Updated last year
boyazeng / understand_bias
View on GitHub
Code release for "Understanding Bias in Large-Scale Visual Datasets"
☆25Dec 4, 2024Updated last year
xUhEngwAng / pinyin
View on GitHub
这个仓库包含了我在上人工智能课时完成的拼音输入法作业。
☆11Feb 16, 2022Updated 4 years ago
tegg89 / Deep-blogs
View on GitHub
A curated lists of self-taught materials including research blogs
☆16Dec 12, 2016Updated 9 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
CraftJarvis / OmniJARVIS
View on GitHub
☆31Jun 25, 2024Updated 2 years ago
mishajw / repeng
View on GitHub
Experiments with representation engineering
☆14Feb 28, 2024Updated 2 years ago
tamangmilan / llama3
View on GitHub
Building Llama 3 from scratch using PyTorch
☆13Sep 1, 2024Updated last year
zhishuifeiqian / VCR-Bench
View on GitHub
VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning
☆37May 9, 2026Updated 2 months ago
alga-hopf / dl-spectral-graph-partitioning
View on GitHub
Deep learning and spectral embedding for graph partitioning
☆14May 13, 2022Updated 4 years ago
nickjiang2378 / vlm-hallucinations
View on GitHub
[ICLR '25] Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"
☆105Nov 30, 2025Updated 7 months ago
steven-ccq / VisualReasoner
View on GitHub
[EMNLP 2024] Official repository for paper "From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis"
☆22Oct 15, 2024Updated last year
kivancgunduz / expiration-date-detection
View on GitHub
An API that detect expiration date from the product package's picture based on Deep Learning Algorithms
☆11Jun 4, 2022Updated 4 years ago
kcshum / pose-conditioned-NeRF-object-fusion
View on GitHub
Official Github repository for paper "Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates"
☆14Mar 22, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
BetterBench / Academic-paper-classification
View on GitHub
text classification compitioin top 10 strategy
☆18Aug 14, 2021Updated 4 years ago
THUNLP-MT / ActiView
View on GitHub
☆11Dec 20, 2024Updated last year
kolchinski / cs236
View on GitHub
☆14Dec 17, 2018Updated 7 years ago
claudia-viaro / Wdss-UCLdss_research
View on GitHub
☆12Aug 31, 2022Updated 3 years ago
clemneo / llava-interp
View on GitHub
☆86Nov 5, 2024Updated last year
rabiulcste / vismin
View on GitHub
[NeurIPS24] VisMin: Visual Minimal-Change Understanding
☆19Mar 3, 2025Updated last year
locuslab / llava-token-compression
View on GitHub
☆47Nov 8, 2024Updated last year
Fangjun-Li / SpatialLM-StepGame
View on GitHub
Codes and data for AAAI-24 paper "Advancing Spatial Reasoning in Large Language Models: An In-depth Evaluation and Enhancement Using the …
☆14Apr 23, 2024Updated 2 years ago
mengxiangming / dmps
View on GitHub
dmps code
☆41Jan 24, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ZurichRain / HMCGR
View on GitHub
code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"
☆10Oct 20, 2022Updated 3 years ago
Cryolite / mjai
View on GitHub
Standardization Project for mjai Format Specification
☆14Aug 28, 2024Updated last year
AshleyHan / SmileGNN
View on GitHub
☆14Jun 22, 2022Updated 4 years ago
metpallyv / DecisionTrees
View on GitHub
Goal of this project is to build Classification Decision Trees and Regression Decision trees without using any Machine learning libraries
☆10Dec 28, 2018Updated 7 years ago
Passerby-D / nonebot_plugin_note
View on GitHub
☆12Jan 31, 2023Updated 3 years ago
rhodesvic / ComputerNetwork-ATopDownApproach
View on GitHub
Computer Network : A Top-Down Approach 8th Resource and Homework
☆15Apr 23, 2021Updated 5 years ago
TJ-CSCCG / TongjiThesis-env
View on GitHub
TongjiThesis Docker 环境 | Docker environment for TongjiThesis (Tongji University thesis LaTeX template)
☆12Mar 28, 2026Updated 3 months ago
amitakamath / whatsup_vlms
View on GitHub
Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".
☆71Feb 28, 2024Updated 2 years ago
TIGER-AI-Lab / VisualWebInstruct
View on GitHub
The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search" [EMNLP25]
☆39Feb 1, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Wangpeiyi9979 / HCL-Text2AMR
View on GitHub
Code for ACL22 short Paper "Hierarchical Curriculum Learning for AMR Parsing"
☆13Jun 1, 2022Updated 4 years ago
NichtsHsu / japanese_mahjong_theory
View on GitHub
日麻牌理分析
☆11Feb 9, 2026Updated 5 months ago
haolunc / iGSM-Replication-physics-LLM
View on GitHub
This repository contains the replication of the iGSM dataset generation process from the Physics of LLM paper by Zeyuan Zhu.
☆17Sep 13, 2024Updated last year
EGO4D / ego-exo4d-egopose
View on GitHub
☆18Apr 16, 2024Updated 2 years ago
jedota / Synthetic_ID-Card_Image
View on GitHub
☆16Mar 4, 2025Updated last year
PositionalHidden / PositionalHidden
View on GitHub
To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …
☆12Jun 18, 2024Updated 2 years ago
apartresearch / mechanisticinterpretability
View on GitHub
A repository for awesome resources in mechanistic interpretability
☆16Jan 18, 2023Updated 3 years ago