HITsz-TMG / Agentic-CIGEval
Code of our paper "A Unified Agentic Framework for Evaluating Conditional Image Generation".
☆21Updated last month
Alternatives and similar repositories for Agentic-CIGEval
Users that are interested in Agentic-CIGEval are comparing it to the libraries listed below
Sorting:
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…☆29Updated last week
- Official Repository of Personalized Visual Instruct Tuning☆28Updated 2 months ago
- ☆40Updated 10 months ago
- The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]☆15Updated 2 months ago
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'☆17Updated 7 months ago
- ☆23Updated 10 months ago
- ☆17Updated 6 months ago
- TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆30Updated 5 months ago
- HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆57Updated 3 months ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆31Updated 3 months ago
- FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax☆18Updated last year
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆44Updated last year
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆47Updated 7 months ago
- [Preprint] Efficient Generative Model Training via Embedded Representation Warmup☆18Updated last month
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 10 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆80Updated last year
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Updated last year
- Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆56Updated last year
- Code for FineRewards☆20Updated last year
- [NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effect…☆35Updated 11 months ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆45Updated 6 months ago
- ☆23Updated last month
- ☆17Updated 9 months ago
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆34Updated last year
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated 9 months ago
- Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"☆20Updated last year
- ☆49Updated 4 months ago
- ☆13Updated 7 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆59Updated 2 months ago
- ☆26Updated 2 months ago