Fangjun-Li / SpatialLM-StepGameLinks
Codes and data for AAAI-24 paper "Advancing Spatial Reasoning in Large Language Models: An In-depth Evaluation and Enhancement Using the StepGame Benchmark"
☆13Updated last year
Alternatives and similar repositories for SpatialLM-StepGame
Users that are interested in SpatialLM-StepGame are comparing it to the libraries listed below
Sorting:
- ☆57Updated 7 months ago
- ☆83Updated 3 months ago
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning☆86Updated last year
- Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective☆32Updated 4 months ago
- Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations" (ICLR '25)☆75Updated last month
- ☆21Updated 7 months ago
- Official Repo for FoodieQA paper (EMNLP 2024)☆16Updated 7 months ago
- [NAACL 25 main] Awesome LLM Causal Reasoning is a collection of LLM-based casual reasoning works, including papers, codes and datasets.☆66Updated 4 months ago
- [NeurIPS 2024]Repos for "Visualization-of-Thought" dataset, construction code and evaluation.☆30Updated 8 months ago
- [ICLR 2025] VL-ICL Bench: The Devil in the Details of Multimodal In-Context Learning☆58Updated 4 months ago
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆32Updated last year
- ☆17Updated 11 months ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆71Updated 4 months ago
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆54Updated last year
- ☆38Updated 11 months ago
- Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts☆16Updated last year
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"☆34Updated 5 months ago
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…☆30Updated last month
- The official implementation of SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning☆15Updated last month
- Self-training LLaVA for medical☆16Updated 7 months ago
- ☆24Updated 2 months ago
- code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning☆16Updated 11 months ago
- DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding☆61Updated 2 weeks ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆68Updated last year
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆60Updated 7 months ago
- ☆27Updated 5 months ago
- [NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models☆29Updated 7 months ago
- ☆17Updated 5 months ago
- Codes for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding☆35Updated last month
- [CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"☆132Updated last year