djghosh13 / geneval
GenEval: An object-focused framework for evaluating text-to-image alignment
☆85Updated last month
Related projects: ⓘ
- [Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation☆190Updated 3 weeks ago
- Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step☆133Updated 2 months ago
- Official code for 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆122Updated 4 months ago
- ☆168Updated 2 months ago
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆74Updated 4 months ago
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆96Updated 2 months ago
- ☆147Updated last year
- HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆69Updated 5 months ago
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆107Updated 2 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆75Updated 2 months ago
- 🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)☆146Updated 5 months ago
- Official implementation of paper "One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications".☆117Updated 8 months ago
- CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆210Updated 2 weeks ago
- Official code for 'Paragraph-to-Image Generation with Information-Enriched Diffusion Model'☆93Updated 4 months ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆228Updated 6 months ago
- [CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"☆160Updated 5 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)☆144Updated 2 weeks ago
- ☆99Updated 6 months ago
- An in-context conditioning version of MUSE with pre-trained checkpoints.☆105Updated last year
- MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance☆84Updated last month
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆118Updated 2 weeks ago
- RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models☆103Updated 3 months ago
- [NeurIPS 2023 Spotlight] Real-World Image Variation by Aligning Diffusion Inversion Chain☆142Updated 8 months ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆90Updated 2 months ago
- Code repository for T2V-Turbo☆166Updated 2 months ago
- Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".☆279Updated 9 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation"☆37Updated this week
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆76Updated 5 months ago
- ☆89Updated 4 months ago
- ☆92Updated 2 months ago