rongyaofang / prism-benchView external linksLinks
This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark"
☆121Jan 29, 2026Updated 2 weeks ago
Alternatives and similar repositories for prism-bench
Users that are interested in prism-bench are comparing it to the libraries listed below
Sorting:
- ☆47Apr 20, 2025Updated 9 months ago
- [AAAI 2026] GenMAC for Compositional Text-to-Video Generation☆32Jan 10, 2026Updated last month
- [SIGGRAPH Asia 2025] Official github repo of SeqTex, an end-to-end 3D texture generation method using video diffusion priors.☆38Dec 12, 2025Updated 2 months ago
- T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation☆36Sep 16, 2025Updated 5 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆184Mar 20, 2025Updated 10 months ago
- Offline implementation of UniREditBench: A Unified Reasoning-based Image Editing Benchmark.☆52Jan 7, 2026Updated last month
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆198Jan 7, 2026Updated last month
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆73Dec 27, 2024Updated last year
- A Comprehensive Dataset for Advanced Image Generation and Editing}☆31Oct 2, 2025Updated 4 months ago
- [NeurIPS 2025 DB] OneIG-Bench is a meticulously designed comprehensive benchmark framework for fine-grained evaluation of T2I models acro…☆107Updated this week
- ☆41Jan 4, 2026Updated last month
- AAAI2026 X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning☆93Nov 21, 2025Updated 2 months ago
- Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"☆19Mar 21, 2024Updated last year
- [ICLR26] GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning☆103Jan 27, 2026Updated 2 weeks ago
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆37Dec 30, 2025Updated last month
- Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?☆86Jul 13, 2025Updated 7 months ago
- ☆97Jun 23, 2025Updated 7 months ago
- Official implementation of "OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes".☆90Jan 14, 2026Updated last month
- [ICLR 2026] Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potenti…☆361Feb 5, 2026Updated last week
- Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).☆420Aug 26, 2025Updated 5 months ago
- Are Video Models Ready as Zero-shot Reasoners?☆84Nov 24, 2025Updated 2 months ago
- ☆10Jan 23, 2025Updated last year
- [arXiv] On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices☆131Nov 27, 2025Updated 2 months ago
- ☆28Apr 8, 2025Updated 10 months ago
- Official Code Release of NeurIPS 2025 Paper: HoloScene: Simulation‑Ready Interactive 3D Worlds from a Single Video☆89Oct 8, 2025Updated 4 months ago
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Aug 13, 2024Updated last year
- [NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆214Sep 27, 2025Updated 4 months ago
- ☆81Jun 23, 2025Updated 7 months ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 3 months ago
- Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"☆308Sep 28, 2025Updated 4 months ago
- DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆39Aug 3, 2025Updated 6 months ago
- VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning☆35Jul 15, 2025Updated 7 months ago
- [ICLR2026] The official code of "Weak-to-Strong Diffusion with Reflection".☆55Jan 28, 2026Updated 2 weeks ago
- Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer☆28Nov 4, 2025Updated 3 months ago
- Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"☆19Jan 18, 2026Updated 3 weeks ago
- ☆13Jul 10, 2024Updated last year
- Generating high-quality image-pairs and training InstructPix2Pix with SDXL☆14Apr 9, 2024Updated last year
- ☆12Dec 4, 2024Updated last year
- Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Finding]"☆15Aug 27, 2025Updated 5 months ago