Wu-Zongyu / CharmBenchLinks

A preview-version of one novel multimodal reasoning benchmark CharmBench.

☆23

Alternatives and similar repositories for CharmBench

Users that are interested in CharmBench are comparing it to the libraries listed below

Sorting:

mm-vl / ULM-R1
Co-Reinforcement Learning for Unified Multimodal Understanding and Generation
☆23Updated last month
jungao1106 / ICoT
[CVPR' 25] Interleaved-Modal Chain-of-Thought
☆80Updated 2 weeks ago
Purshow / Awesome-LVLM-Hallucination
☆49Updated 9 months ago
GuanchengWan / awesome-ai-ml-papers-auto
🔥 【Meta Awesome List】: AI/ML Research Hub - Solving the "Chasing Hot Topics" Problem for AI Researchers. 🤖 Agent-driven intelligence au…
☆42Updated this week
WayneJin0918 / SOTA-paper-rating.io
A tiny paper rating web
☆39Updated 5 months ago
arctanxarc / MC-LLaVA
Official implementation of MC-LLaVA.
☆139Updated 2 weeks ago
yanghlll / ScalingNoise
☆39Updated 5 months ago
NOVAglow646 / LLM-MLLM-paper-list
关于LLM和Multimodal LLM的paper list
☆43Updated last week
arctanxarc / UniCTokens
A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstrating…
☆121Updated 3 weeks ago
Purshow / Awesome-Unified-Multimodal
📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.
☆281Updated 3 weeks ago
ML-GSAI / Diffusion-LLM-Papers
A Collection of Papers on Diffusion Language Models
☆119Updated last week
LINs-lab / awesome_papers
☆20Updated 3 months ago
SAIS-FUXI / EvalAlign
☆18Updated 10 months ago
BarretBa / ICTHP
Enhancing Reward Models for High-quality Image Generation: Beyond Text-Image Alignment [ICCV 2025] - Official implementation
☆28Updated last month
chengzu-li / MVoT
Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)
☆42Updated 4 months ago
NOVAglow646 / OOD-Generalization-Paper-Reading-Notes
OOD Generalization相关文章的阅读笔记
☆31Updated 8 months ago
Wang-Xiaodong1899 / CVPR25-MLLM-Paper-List
🔥CVPR 2025 Multimodal Large Language Models Paper List
☆153Updated 5 months ago
Cooperx521 / ScaleCap
Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’
☆53Updated 2 months ago
xinyan-cxy / MINT-CoT
☆67Updated last month
Dongping-Chen / ISG
(ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.
☆27Updated 3 weeks ago
PKU-YuanGroup / WISE
WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation
☆144Updated 3 weeks ago
vlm2-bench / VLM2-Bench
VLM2-Bench [ACL 2025 Main]: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues
☆41Updated 3 months ago
GAIR-NLP / thinking-with-generated-images
Doodling our way to AGI ✏️ 🖼️ 🧠
☆94Updated 3 months ago
HKUST-LongGroup / Awesome-MLLM-Benchmarks
☆138Updated 6 months ago
yu-rp / apiprompting
[ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models
☆102Updated 10 months ago
Osilly / Awesome-Interleaving-Reasoning
Interleaving Reasoning: Next-Generation Reasoning Systems for AGI
☆136Updated last month
PKU-YuanGroup / AsFT
Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".
☆25Updated last month
Xiaohao-Liu / ModalBed
Towards Modality Generalization: A Benchmark and Prospective Analysis
☆25Updated 3 months ago
itsqyh / Awesome-LMMs-Mechanistic-Interpretability
A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…
☆123Updated last month
huaishengzhu / DSPO
☆28Updated 3 months ago