google-research-datasets / richhf-18k
RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with the file name of the associated labeled images (no urls or images are included in this dataset).
☆96Updated 2 months ago
Related projects: ⓘ
- ☆89Updated 4 months ago
- [Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation☆190Updated 3 weeks ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation"☆37Updated this week
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆75Updated 11 months ago
- TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering☆132Updated 4 months ago
- ☆72Updated 5 months ago
- GenEval: An object-focused framework for evaluating text-to-image alignment☆85Updated last month
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆74Updated 4 months ago
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆73Updated last month
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆75Updated 2 months ago
- An in-context conditioning version of MUSE with pre-trained checkpoints.☆105Updated last year
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models☆43Updated 3 months ago
- Implementation of "DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning"☆75Updated last year
- Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing (NeurIPS 2023)☆85Updated 4 months ago
- ☆109Updated 2 months ago
- SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training☆148Updated 2 months ago
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".☆91Updated 2 months ago
- ☆168Updated 2 months ago
- [NeurIPS 2023 Datasets and Benchmarks] "FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation", Yuanxin L…☆47Updated 6 months ago
- Official implementation of the paper "Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models☆156Updated 11 months ago
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆34Updated 2 months ago
- Training code for CLIP-FlanT5☆15Updated last month
- Ablating Concepts in Text-to-Image Diffusion Models (ICCV 2023)☆144Updated 8 months ago
- Densely Captioned Images (DCI) dataset repository.☆155Updated 2 months ago
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆68Updated 10 months ago
- ICCV2023-Diffusion-Papers☆110Updated last year
- The benchmark of SOTA text-to-image diffusion models with a new benchmarking strategy based on MiniGPT-4, namely X-IQE.☆105Updated last year
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆76Updated 5 months ago
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆107Updated 2 months ago
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆118Updated 2 weeks ago