☆12Jan 10, 2025Updated last year
Alternatives and similar repositories for Spatial-MM
Users that are interested in Spatial-MM are comparing it to the libraries listed below
Sorting:
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆22Dec 4, 2024Updated last year
- [TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.☆139Mar 25, 2023Updated 2 years ago
- ☆79Nov 5, 2024Updated last year
- ☆30Jun 25, 2024Updated last year
- VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning☆35Jul 15, 2025Updated 7 months ago
- code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"☆10Oct 20, 2022Updated 3 years ago
- ☆96Jan 27, 2026Updated last month
- dmps code☆39Jan 24, 2024Updated 2 years ago
- Repository for awesome spatial/visual reasoning MLLMs. (focus more on embodied applications)☆71Jun 26, 2025Updated 8 months ago
- [ICLR '25] Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"☆95Nov 30, 2025Updated 3 months ago
- Image reconstruction from human brain activity by VAE and adversarial learning☆12May 21, 2022Updated 3 years ago
- A large-scale training and benchmarking framework for rPPG.☆10Nov 26, 2024Updated last year
- Goal of this project is to build Classification Decision Trees and Regression Decision trees without using any Machine learning libraries☆10Dec 28, 2018Updated 7 years ago
- ☆47Nov 8, 2024Updated last year
- [CVPR 2025] DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models☆67Dec 1, 2025Updated 3 months ago
- [NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"☆313Dec 14, 2024Updated last year
- Official code for "Weakly Supervised Two-Stage Training Scheme for Deep Video Fight Detection Model"☆12Oct 29, 2022Updated 3 years ago
- ☆12Dec 20, 2024Updated last year
- [ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…☆11Nov 28, 2023Updated 2 years ago
- ☆11Dec 13, 2023Updated 2 years ago
- Library for automatic time series forecasting based on ARIMA models☆12May 14, 2017Updated 8 years ago
- ☆10Oct 17, 2022Updated 3 years ago
- [WACV 2025] High-Fidelity Document Stain Removal via A Large-Scale Real-World Dataset and A Memory-Augmented Transformer☆21Jan 14, 2026Updated last month
- Code for ACL22 short Paper "Hierarchical Curriculum Learning for AMR Parsing"☆13Jun 1, 2022Updated 3 years ago
- A collection of papers tackling automatic fact-checking (particularly of AI-generated content)☆14Nov 3, 2023Updated 2 years ago
- NightSurveillance Sataset for Pedestrian Detection☆11Jul 30, 2020Updated 5 years ago
- A digital twin of the city of Chicago along with automated sensors☆12Nov 14, 2019Updated 6 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Jan 23, 2018Updated 8 years ago
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- ICCV'23 | Adverse Weather Removal with Codebook Priors☆10Aug 28, 2023Updated 2 years ago
- ☆13May 16, 2024Updated last year
- [ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"☆13Jun 11, 2023Updated 2 years ago
- ☆10May 4, 2018Updated 7 years ago
- ☆10Oct 24, 2022Updated 3 years ago
- ☆13Dec 22, 2021Updated 4 years ago
- An experiment with movie scenes and contrastive learning☆11Feb 1, 2025Updated last year
- MXNet-Gluon model to Caffe (support SSD in gluoncv)☆10Jun 20, 2019Updated 6 years ago
- Implementation of PPO for CartPole-v1☆10Jan 1, 2019Updated 7 years ago
- ☆11Apr 10, 2024Updated last year