Wu-Zongyu / CharmBenchLinks
A preview-version of one novel multimodal reasoning benchmark CharmBench.
☆23Updated 2 weeks ago
Alternatives and similar repositories for CharmBench
Users that are interested in CharmBench are comparing it to the libraries listed below
Sorting:
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generation☆23Updated last month
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆80Updated 2 weeks ago
- ☆49Updated 9 months ago
- 🔥 【Meta Awesome List】: AI/ML Research Hub - Solving the "Chasing Hot Topics" Problem for AI Researchers. 🤖 Agent-driven intelligence au…☆42Updated this week
- A tiny paper rating web☆39Updated 5 months ago
- Official implementation of MC-LLaVA.☆139Updated 2 weeks ago
- ☆39Updated 5 months ago
- 关于LLM和Multimodal LLM的paper list☆43Updated last week
- A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstrating…☆121Updated 3 weeks ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆281Updated 3 weeks ago
- A Collection of Papers on Diffusion Language Models☆119Updated last week
- ☆20Updated 3 months ago
- ☆18Updated 10 months ago
- Enhancing Reward Models for High-quality Image Generation: Beyond Text-Image Alignment [ICCV 2025] - Official implementation☆28Updated last month
- Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)☆42Updated 4 months ago
- OOD Generalization相关文章的阅读笔记☆31Updated 8 months ago
- 🔥CVPR 2025 Multimodal Large Language Models Paper List☆153Updated 5 months ago
- Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’☆53Updated 2 months ago
- ☆67Updated last month
- (ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.☆27Updated 3 weeks ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆144Updated 3 weeks ago
- VLM2-Bench [ACL 2025 Main]: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues☆41Updated 3 months ago
- Doodling our way to AGI ✏️ 🖼️ 🧠☆94Updated 3 months ago
- ☆138Updated 6 months ago
- [ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models☆102Updated 10 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆136Updated last month
- Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".☆25Updated last month
- Towards Modality Generalization: A Benchmark and Prospective Analysis☆25Updated 3 months ago
- A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…☆123Updated last month
- ☆28Updated 3 months ago