ning-mz / SCA-GPSLinks
Code of ACM MM 2023 Paper: A Symbolic Characters Aware Model for Solving Geometry Problems
☆16Updated 2 years ago
Alternatives and similar repositories for SCA-GPS
Users that are interested in SCA-GPS are comparing it to the libraries listed below
Sorting:
- ☆88Updated last year
- [AAAI 2025]Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning☆42Updated 9 months ago
- [ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain☆106Updated last year
- ☆39Updated last year
- [ICLR 2025] ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation☆130Updated last month
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning☆90Updated last year
- MAT: Multi-modal Agent Tuning 🔥 ICLR 2025 (Spotlight)☆84Updated last month
- A RLHF Infrastructure for Vision-Language Models☆193Updated last year
- An benchmark for evaluating the capabilities of large vision-language models (LVLMs)☆46Updated 2 years ago
- ☆51Updated last year
- ☆102Updated 2 years ago
- Official Code of IdealGPT☆35Updated 2 years ago
- A Self-Training Framework for Vision-Language Reasoning☆88Updated last year
- [ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models☆156Updated last year
- ☆133Updated last year
- Official github repo of G-LLaVA☆148Updated 11 months ago
- [ACL 2025] "World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning." https://arxiv.org/abs/2503.1…☆16Updated 6 months ago
- MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts☆353Updated 4 months ago
- Offical Repository of "AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning"☆61Updated 2 months ago
- MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU☆50Updated 6 months ago
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆53Updated 4 months ago
- [NeurIPS 2024] MATH-Vision dataset and code to measure multimodal mathematical reasoning capabilities.☆128Updated 8 months ago
- [ACL 2025] A Neural-Symbolic Self-Training Framework☆117Updated 8 months ago
- Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization☆100Updated 2 years ago
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating☆98Updated 2 years ago
- Official Repository of "Learning what reinforcement learning can't"☆79Updated last month
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆134Updated 10 months ago
- ☆110Updated last year
- [2025-TMLR] A Survey on the Honesty of Large Language Models☆64Updated last year
- Code for our Paper "All in an Aggregated Image for In-Image Learning"☆29Updated last year