[NeurIPS 2025 DB] OneIG-Bench is a meticulously designed comprehensive benchmark framework for fine-grained evaluation of T2I models across multiple dimensions, including subject-element alignment, text rendering precision, reasoning-generated content, stylization, and diversity.
☆107Feb 10, 2026Updated 2 weeks ago
Alternatives and similar repositories for OneIG-Benchmark
Users that are interested in OneIG-Benchmark are comparing it to the libraries listed below
Sorting:
- [ICLR 2026] The official implementation of "RegionE: Adaptive Region-Aware Generation for Efficient Image Editing"☆78Feb 3, 2026Updated 3 weeks ago
- [NIPS 25'] Evaluation code of paper "KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models"☆40Oct 19, 2025Updated 4 months ago
- This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehe…☆121Jan 29, 2026Updated last month
- (ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.☆31Aug 7, 2025Updated 6 months ago
- Official repository for the paper "TIIF-Bench: How Does Your T2I Model Follow Your Instructions?".☆161Nov 14, 2025Updated 3 months ago
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆21Jun 23, 2025Updated 8 months ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 2 months ago
- E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models☆39Jan 5, 2026Updated last month
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- [IJCV 2026] HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts☆26Feb 28, 2025Updated last year
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 3 months ago
- ☆27Updated this week
- [CVPR 2026] ViStoryBench: AI Story Visualization Benchmark☆133Feb 21, 2026Updated last week
- Doodling our way to AGI ✏️ 🖼️ 🧠☆121May 29, 2025Updated 9 months ago
- [ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision☆210Jan 27, 2026Updated last month
- ☆34Jan 25, 2026Updated last month
- Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach☆14Apr 2, 2025Updated 10 months ago
- ComfyUI version of WithAnyone☆23Dec 18, 2025Updated 2 months ago
- Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think & UnifiedReward-Flex☆706Feb 10, 2026Updated 2 weeks ago
- [Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation☆330Dec 24, 2025Updated 2 months ago
- ☆12Feb 13, 2025Updated last year
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆34Jan 16, 2026Updated last month
- The official repo for the DanQing dataset.☆29Jan 16, 2026Updated last month
- More reliable Video Understanding Evaluation☆14Sep 23, 2025Updated 5 months ago
- Repo for "Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content"☆40Jun 9, 2025Updated 8 months ago
- [ACL 2025 Main] Open-source toolkit for automatic evaluation of text-to-image generation task, including training & test datasets and a d…☆16Jul 5, 2025Updated 7 months ago
- [CVPR 2025] T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆106Oct 25, 2025Updated 4 months ago
- Official implementation of paper: "SwinTExCo: Exemplar-based Video Colorization using Swin Transformer"☆13Oct 6, 2024Updated last year
- UniEval: Unified Holistic Evaluation for Unified Multimodal Understanding and Generation☆22May 16, 2025Updated 9 months ago
- ☆14Jun 25, 2025Updated 8 months ago
- MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs☆38Feb 19, 2026Updated last week
- PICABench: How Far Are We from Physically Realistic Image Editing?☆35Nov 5, 2025Updated 3 months ago
- MuCR is a benchmark designed to evaluate Multimodal Large Language Models' (MLLMs) ability to discern causal links across modalities☆18May 27, 2025Updated 9 months ago
- Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning☆232Feb 10, 2026Updated 2 weeks ago
- Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback☆234Jan 24, 2026Updated last month
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆19Oct 14, 2024Updated last year
- Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).☆420Aug 26, 2025Updated 6 months ago
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆43Apr 10, 2025Updated 10 months ago
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google☆63Aug 14, 2024Updated last year