ning-mz / SCA-GPSLinks

Code of ACM MM 2023 Paper: A Symbolic Characters Aware Model for Solving Geometry Problems

☆14

Alternatives and similar repositories for SCA-GPS

Users that are interested in SCA-GPS are comparing it to the libraries listed below

Sorting:

LightChen233 / M3CoT
☆78Updated last year
wwzhuang01 / Math-PUMA
[AAAI 2025]Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning
☆33Updated 3 months ago
rookie-joe / AutoPSV
☆47Updated 8 months ago
shiqichen17 / AdaptVis
Github repository for "Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas" (ICML 2025)
☆35Updated 2 months ago
njucckevin / MM-Self-Improve
A Self-Training Framework for Vision-Language Reasoning
☆80Updated 6 months ago
bigai-nlco / LatentSeek
Official Repository of LatentSeek
☆56Updated last month
Osilly / Awesome-Interleaving-Reasoning
Interleaving Reasoning: Next-Generation Reasoning Systems for AGI
☆96Updated 2 weeks ago
YiyangZhou / POVID
[Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning
☆86Updated last year
pkunlp-icler / PCA-EVAL
[ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain
☆105Updated last year
SPIRAL-MED / Ophiuchus
☆38Updated 6 months ago
Quinn777 / AtomThink
Can Atomic Step Decomposition Enhance the Self-structured Reasoning of Multimodal Large Models?
☆24Updated 4 months ago
FudanDISC / ReForm-Eval
An benchmark for evaluating the capabilities of large vision-language models (LVLMs)
☆46Updated last year
OpenKG-ORG / EasyDetect
An Easy-to-use Hallucination Detection Framework for LLMs.
☆59Updated last year
kokolerk / TON
Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models
☆40Updated last week
YiyangZhou / LURE
[ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models
☆147Updated last year
szxiangjn / world-model-for-language-model
☆131Updated last year
luka-group / mDPO
[EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.
☆78Updated 8 months ago
claws-lab / projection-in-MLLMs
Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'
☆15Updated last year
mathllm / MATH-V
[NeurIPS 2024] MATH-Vision dataset and code to measure multimodal mathematical reasoning capabilities.
☆110Updated 2 months ago
Joshua-Ren / Learning_dynamics_LLM
☆150Updated 2 months ago
TheRoadQaQ / ReLIFT
Official Repository of "Learning what reinforcement learning can't"
☆46Updated last week
opendatalab / HA-DPO
Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization
☆90Updated last year
Hxyou / IdealGPT
Official Code of IdealGPT
☆35Updated last year
YiyangZhou / CSR
[NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models
☆76Updated last year
RyanLiu112 / Awesome-Process-Reward-Models
A comprehensive collection of process reward models.
☆95Updated 3 weeks ago
TideDra / VL-RLHF
A RLHF Infrastructure for Vision-Language Models
☆179Updated 8 months ago
yuhui-zh15 / AutoConverter
Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…
☆32Updated last month
Walter0807 / RepBelief
[ICML 2024] Language Models Represent Beliefs of Self and Others
☆33Updated 9 months ago
UCSC-VLAA / VLAA-Thinking
SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models
☆125Updated 3 months ago
zwq2018 / Multi-modal-Self-instruct
The codebase for our EMNLP24 paper: Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Mo…
☆81Updated 5 months ago