albertwy / GPT-4V-EvaluationLinks

Data for evaluating GPT-4V

☆11

Alternatives and similar repositories for GPT-4V-Evaluation

Users that are interested in GPT-4V-Evaluation are comparing it to the libraries listed below

Sorting:

ChartMimic / ChartMimic
[ICLR 2025] ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation
☆125Updated 5 months ago
RenShuhuai-Andy / my-tools
my commonly-used tools
☆63Updated 10 months ago
ZhangYiqun018 / StickerConv
☆59Updated last year
LightChen233 / M3CoT
☆84Updated last year
Aman-4-Real / awesome-multimodal-dialogue
Paper, dataset and code list for multimodal dialogue.
☆22Updated 10 months ago
HaozheZhao / MIC_tool
☆14Updated 2 years ago
Aman-4-Real / MMTG
[ACM MM 2022]: Multi-Modal Experience Inspired AI Creation
☆21Updated 11 months ago
xieyuquanxx / awesome-Large-MultiModal-Hallucination
😎 curated list of awesome LMM hallucinations papers, methods & resources.
☆150Updated last year
PLUM-Lab / MultiInstruct
MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning
☆133Updated 2 years ago
lancopku / label-words-are-anchors
Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
☆165Updated last year
luka-group / mDPO
[EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.
☆83Updated last year
victorsungo / MMDialog
The official site of paper MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation
☆202Updated 2 years ago
HanNight / soft_self_consistency
Code for ACL 2024 paper "Soft Self-Consistency Improves Language Model Agents"
☆25Updated last year
yushuiwx / Mixture-of-LoRA-Experts
☆58Updated 11 months ago
NUSTM / LLMs-Waver-In-Judgments
☆12Updated last year
SihengLi99 / LLM-Honesty-Survey
[2025-TMLR] A Survey on the Honesty of Large Language Models
☆62Updated 11 months ago
pkunlp-icler / IKE
☆25Updated 2 years ago
IndexFziQ / Diffusion4NLP-Papers
A paper list about diffusion models for natural language processing.
☆182Updated 2 years ago
pldlgb / nuggets
☆86Updated last year
patrick-tssn / Awesome-Colorful-LLM
Recent advancements propelled by large language models (LLMs), encompassing an array of domains including Vision, Audio, Agent, Robotics,…
☆123Updated 5 months ago
RUCAIBox / CARP
☆17Updated 2 years ago
JetRunner / SuperICL
Code for "Small Models are Valuable Plug-ins for Large Language Models"
☆131Updated 2 years ago
X-PLUG / mPLUG-HalOwl
mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating
☆98Updated last year
Skytliang / COT-Reading-List
☆27Updated 2 years ago
OpenMOSS / Say-I-Dont-Know
[ICML'2024] Can AI Assistants Know What They Don't Know?
☆83Updated last year
ZhangYiqun018 / Multimodel-Dialog
自己阅读的多模态对话系统论文（及部分笔记）汇总
☆23Updated 2 years ago
FudanDISC / ReForm-Eval
An benchmark for evaluating the capabilities of large vision-language models (LVLMs)
☆45Updated 2 years ago
vlf-silkie / VLFeedback
☆100Updated last year
BenfengXu / KNNPrompting
Released code for our ICLR23 paper.
☆66Updated 2 years ago
RUCAIBox / POPE
The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''
☆226Updated 3 months ago