mathvision-cuhk / MATH-V

MATH-Vision dataset and code to measure Multimodal Mathematical Reasoning capabilities.

☆53

Related projects: ⓘ

yihedeng9 / STIC
Enhancing Large Vision Language Models with Self-Training on Image Comprehension.
☆51Updated 3 months ago
HZQ950419 / Math-LLaVA
Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models
☆52Updated 2 months ago
vlf-silkie / VLFeedback
☆73Updated 8 months ago
junyangwang0410 / AMBER
An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation
☆85Updated 8 months ago
FudanDISC / ReForm-Eval
An benchmark for evaluating the capabilities of large vision-language models (LVLMs)
☆32Updated 10 months ago
AoiDragon / POPE
[EMNLP'23] The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''
☆67Updated 5 months ago
Dongping-Chen / MLLM-Judge
[ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.
☆47Updated last month
FreedomIntelligence / MLLM-Bench
MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria
☆49Updated last month
yfzhang114 / LLaVA-Align
This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strat…
☆66Updated 5 months ago
TideDra / VL-RLHF
A RLHF Infrastructure for Vision-Language Models
☆86Updated 3 months ago
TIGER-AI-Lab / Mantis
Official code for Paper "Mantis: Multi-Image Instruction Tuning"
☆158Updated last week
tianyi-lab / HallusionBench
[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(…
☆220Updated 6 months ago
pengts / VW-LMM
☆20Updated 4 months ago
FuxiaoLiu / MMC
[NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning
☆75Updated last month
deepcs233 / Visual-CoT
Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning
☆93Updated 2 months ago
RUCAIBox / POPE
The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''
☆166Updated 5 months ago
OpenGVLab / MM-NIAH
This is the official implementation of the paper "Needle In A Multimodal Haystack"
☆72Updated 2 months ago
thunlp / Muffin
☆53Updated 7 months ago
DAMO-NLP-SG / VCD
[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding
☆177Updated 2 months ago
opendatalab / HA-DPO
Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization
☆58Updated 7 months ago
OpenGVLab / ChartAst
ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.
☆101Updated last week
wangclnlp / Vision-LLM-Alignment
This repo contains the codes for supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF) designed for vision L…
☆39Updated 2 weeks ago
42Shawn / LLaVA-PruMerge
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
☆86Updated 4 months ago
lzhxmu / VTW
☆20Updated last month
yaolinli / DeCo
☆11Updated 2 months ago
waltonfuture / InstructionGPT-4
InstructionGPT-4
☆35Updated 8 months ago
princeton-nlp / CharXiv
CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
☆66Updated last month
OpenGVLab / MMT-Bench
ICML'2024 | MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
☆84Updated 2 months ago
OpenKG-ORG / EasyDetect
An Easy-to-use Hallucination Detection Framework for LLMs.
☆48Updated 4 months ago
YiyangZhou / LURE
[ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models
☆128Updated 4 months ago