HITsz-TMG / Agentic-CIGEval
Code of our paper "A Unified Agentic Framework for Evaluating Conditional Image Generation".
☆17Updated last week
Alternatives and similar repositories for Agentic-CIGEval:
Users that are interested in Agentic-CIGEval are comparing it to the libraries listed below
- Official Repository of Personalized Visual Instruct Tuning☆28Updated last month
- ☆40Updated 9 months ago
- ☆17Updated 6 months ago
- ☆22Updated 10 months ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆31Updated 2 months ago
- The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]☆15Updated last month
- Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing☆27Updated 4 months ago
- ☆22Updated 3 weeks ago
- Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"☆19Updated last year
- [CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation☆27Updated last month
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆47Updated 6 months ago
- HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆54Updated 2 months ago
- [ICLR2025] IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis☆33Updated 2 months ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆45Updated 5 months ago
- Code for FineRewards☆20Updated last year
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences☆37Updated last month
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆44Updated last year
- [ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models☆69Updated 7 months ago
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation☆70Updated 9 months ago
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…☆42Updated 2 months ago
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google☆52Updated 8 months ago
- Unifying Visual Understanding and Generation with Dual Visual Vocabularies 🌈☆39Updated this week
- Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".☆52Updated last week
- [ICLR 2025] Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆35Updated this week
- [NeurIPS'24] I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing☆19Updated 4 months ago
- Official repository of IDEA-Bench☆34Updated 2 months ago
- Compositional Inversion for Stable Diffusion Models (AAAI 2024)☆35Updated last month
- TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆30Updated 4 months ago
- Official implement of MIA-DPO☆55Updated 3 months ago
- Video Diffusion State Space Models☆19Updated last year