Fangjun-Li / SpatialLM-StepGameLinks

Codes and data for AAAI-24 paper "Advancing Spatial Reasoning in Large Language Models: An In-depth Evaluation and Enhancement Using the StepGame Benchmark"

☆13

Alternatives and similar repositories for SpatialLM-StepGame

Users that are interested in SpatialLM-StepGame are comparing it to the libraries listed below

Sorting:

clemneo / llava-interp
☆57Updated 7 months ago
IntelLabs / lvlm-interpret
☆83Updated 3 months ago
YiyangZhou / POVID
[Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning
☆86Updated last year
Lingkai-Kong / RE-Control
Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective
☆32Updated 4 months ago
nickjiang2378 / vl-interp
Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations" (ICLR '25)
☆75Updated last month
OpenCausaLab / CELLO
☆21Updated 7 months ago
lyan62 / FoodieQA
Official Repo for FoodieQA paper (EMNLP 2024)
☆16Updated 7 months ago
chendl02 / Awesome-LLM-Causal-Reasoning
[NAACL 25 main] Awesome LLM Causal Reasoning is a collection of LLM-based casual reasoning works, including papers, codes and datasets.
☆66Updated 4 months ago
microsoft / visualization-of-thought
[NeurIPS 2024]Repos for "Visualization-of-Thought" dataset, construction code and evaluation.
☆30Updated 8 months ago
ys-zong / VL-ICL
[ICLR 2025] VL-ICL Bench: The Devil in the Details of Multimodal In-Context Learning
☆58Updated 4 months ago
fangyuan-ksgk / CoT-Reasoning-without-Prompting
Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting
☆32Updated last year
MediaBrain-SJTU / MoLA
☆17Updated 11 months ago
Dongping-Chen / MLLM-Judge
[ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.
☆71Updated 4 months ago
amitakamath / whatsup_vlms
Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".
☆54Updated last year
behavioral-data / TSandLanguage
☆38Updated 11 months ago
YuejiangLIU / csl
Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts
☆16Updated last year
tmlr-group / NoisyRationales
[NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"
☆34Updated 5 months ago
yuhui-zh15 / AutoConverter
Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…
☆30Updated last month
chen-judge / SPC
The official implementation of SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning
☆15Updated last month
heliossun / STLLaVA-Med
Self-training LLaVA for medical
☆16Updated 7 months ago
Zayne-sprague / To-CoT-or-not-to-CoT
☆24Updated 2 months ago
MajorDavidZhang / MCL
code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning
☆16Updated 11 months ago
thunlp / DeepPerception
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding
☆61Updated 2 weeks ago
yihedeng9 / STIC
Enhancing Large Vision Language Models with Self-Training on Image Comprehension.
☆68Updated last year
shengliu66 / VTI
Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering
☆60Updated 7 months ago
stellalisy / mediQ
☆27Updated 5 months ago
sled-group / moh
[NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models
☆29Updated 7 months ago
kingdy2002 / SPA
☆17Updated 5 months ago
zeyofu / ReFocus_Code
Codes for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding
☆35Updated last month
chancharikmitra / CCoT
[CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"
☆132Updated last year