OpenMOSS / GAOKAO-MM
[ACL'2024 Findings] GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation
☆53Updated last year
Alternatives and similar repositories for GAOKAO-MM:
Users that are interested in GAOKAO-MM are comparing it to the libraries listed below
- ☆64Updated 9 months ago
- [AAAI 2025]Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning☆30Updated 5 months ago
- ☆40Updated 8 months ago
- ☆80Updated last year
- MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models☆26Updated last month
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆38Updated 7 months ago
- ☆26Updated 4 months ago
- Paper collections of multi-modal LLM for Math/STEM/Code.☆80Updated 2 weeks ago
- ☆26Updated 4 months ago
- The demo, code and data of FollowRAG☆70Updated 3 months ago
- ☆57Updated last month
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆79Updated last year
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆102Updated 4 months ago
- Official repository of MMDU dataset☆86Updated 5 months ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆161Updated last year
- The official repository of the Omni-MATH benchmark.☆74Updated 2 months ago
- ☆49Updated 4 months ago
- A Self-Training Framework for Vision-Language Reasoning☆69Updated last month
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆95Updated 2 weeks ago
- ☆30Updated 5 months ago
- [NeurIPS DB Track, 2024] MATH-Vision dataset and code to measure multimodal mathematical reasoning capabilities.☆89Updated this week
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆115Updated 4 months ago
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆70Updated 7 months ago
- Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal …☆44Updated 2 weeks ago
- ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation☆101Updated 7 months ago
- ☆30Updated 2 months ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆54Updated 7 months ago
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating☆90Updated last year