zhangjiewu / awesome-t2i-evalLinks
A curated list of papers and resources for text-to-image evaluation.
β29Updated last year
Alternatives and similar repositories for awesome-t2i-eval
Users that are interested in awesome-t2i-eval are comparing it to the libraries listed below
Sorting:
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"β64Updated last year
- π€ Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".β42Updated 2 years ago
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editinβ¦β30Updated 2 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generationβ80Updated last year
- β21Updated 2 years ago
- MCPL: MULTI-CONCEPT PROMPT LEARNINGβ20Updated last year
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Dataβ34Updated last year
- β19Updated 2 years ago
- The official repository of paper "ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection" (Nβ¦β50Updated last year
- VidKV: Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Modelsβ21Updated 3 months ago
- ποΈ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"β107Updated last year
- Official PyTorch implementation of "Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesisβ¦β44Updated last year
- Video Diffusion State Space Modelsβ19Updated last year
- DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Modelsβ46Updated last year
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Methodβ27Updated last year
- β24Updated last year
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'β23Updated 9 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.β49Updated 9 months ago
- Implementation of MDP: A Generalized Framework for Text-Guided Image Editing by Manipulating the Diffusion Pathβ69Updated 2 years ago
- Democratising RGBA Image Generation With No $$$ (AI4VA@ECCV24)β30Updated 10 months ago
- πPytorch implementation of "Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion"β27Updated 8 months ago
- ECCV2024_Parrot Captions Teach CLIP to Spot Textβ66Updated 10 months ago
- [ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stitβ¦β103Updated last year
- β26Updated 4 months ago
- β39Updated last year
- Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"β29Updated last year
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)β45Updated last year
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Modelβ42Updated 11 months ago
- TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generationβ31Updated 7 months ago
- β85Updated last year