OpenMOSS / GAOKAO-MM
[ACL'2024 Findings] GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation
☆37Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for GAOKAO-MM
- Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal …☆27Updated this week
- ☆32Updated 5 months ago
- A Survey on the Honesty of Large Language Models☆46Updated last month
- The reinforcement learning codes for dataset SPA-VL☆21Updated 4 months ago
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating☆79Updated 9 months ago
- ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation☆94Updated 4 months ago
- [EMNLP 2024 Findings🔥] Official implementation of "LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Infe…☆75Updated last week
- VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models☆26Updated 4 months ago
- ☆25Updated last month
- The demo, code and data of FollowRAG☆61Updated 3 weeks ago
- A RLHF Infrastructure for Vision-Language Models☆106Updated last week
- ☆71Updated 10 months ago
- ☆21Updated last month
- 😎 up-to-date & curated list of awesome LMM hallucinations papers, methods & resources.☆146Updated 8 months ago
- ☆38Updated 5 months ago
- ☆116Updated 4 months ago
- [CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback☆235Updated 2 months ago
- MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models☆21Updated 2 months ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆153Updated 9 months ago
- ☆24Updated 3 weeks ago
- Official github repo of G-LLaVA☆122Updated 5 months ago
- A Self-Training Framework for Vision-Language Reasoning☆18Updated last week
- An Easy-to-use Hallucination Detection Framework for LLMs.☆48Updated 7 months ago
- Official repository of MMDU dataset☆75Updated last month
- ☆65Updated 2 months ago
- MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria☆55Updated last month
- Official Repo of "MMBench: Is Your Multi-modal Model an All-around Player?"☆163Updated 2 months ago
- [SIGIR'24] The official implementation code of MOELoRA.☆127Updated 4 months ago
- ☆147Updated 4 months ago
- ☆33Updated 4 months ago