openai / dalle3-eval-samples
Text-to-image samples collected for the evaluation of DALL-E 3 in the whitepaper.
☆63Updated last year
Alternatives and similar repositories for dalle3-eval-samples
Users that are interested in dalle3-eval-samples are comparing it to the libraries listed below
Sorting:
- A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)☆168Updated last month
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆87Updated 5 months ago
- TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering☆160Updated last year
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆284Updated 6 months ago
- ☆85Updated last year
- GenEval: An object-focused framework for evaluating text-to-image alignment☆270Updated 2 months ago
- ☆171Updated last year
- Simple large-scale training of stable diffusion with multi-node support.☆132Updated 2 years ago
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆100Updated last year
- [ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stit…☆101Updated last year
- An implementation of the Prompt-to-Prompt paper for the SDXL architecture☆110Updated 11 months ago
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆130Updated 10 months ago
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆75Updated 11 months ago
- [ICML 2025] This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"☆130Updated 11 months ago
- [NeurIPS 2024] VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆146Updated 7 months ago
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆74Updated 4 months ago
- Official implementation of paper "One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications".☆141Updated last year
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆109Updated 2 months ago
- Matryoshka Multimodal Models☆106Updated 3 months ago
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google☆54Updated 9 months ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆99Updated 10 months ago
- ☆48Updated last year
- VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆237Updated last month
- LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation☆130Updated last year
- Code release for our NeurIPS 2024 Spotlight paper "GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing"☆122Updated 6 months ago
- ☆64Updated 9 months ago
- Implementation of the premier Text to Video model from OpenAI☆57Updated 6 months ago
- [Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation☆255Updated last month
- Educational repository for applying the main video data curation techniques presented in the Stable Video Diffusion paper.☆82Updated last year
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆284Updated last year