j-min / DSGLinks

Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)

☆91

Alternatives and similar repositories for DSG

Users that are interested in DSG are comparing it to the libraries listed below

Sorting:

google-research-datasets / richhf-18k
RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…
☆138Updated last year
Yushi-Hu / tifa
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
☆170Updated last year
j-min / VPGen
Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)
☆56Updated 2 years ago
UCSC-VLAA / HQ-Edit
[ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing
☆103Updated last year
salesforce / HIVE
☆111Updated 6 months ago
xichenpan / Kosmos-G
Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models
☆73Updated last year
weijiawu / ParaDiffusion
[IJCV 2025] Paragraph-to-Image Generation with Information-Enriched Diffusion Model
☆105Updated 4 months ago
TIGER-AI-Lab / VIEScore
Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024…
☆50Updated 8 months ago
TIGER-AI-Lab / ImagenHub
A one-stop library to standardize the inference and evaluation of all the conditional image generation models. [ICLR 2024]
☆170Updated 3 months ago
hananshafi / llmblueprint
[ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"
☆80Updated last year
nanlliu / Unsupervised-Compositional-Concepts-Discovery
[ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models
☆84Updated last year
mlpc-ucsd / TokenCompose
(CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision
☆126Updated 7 months ago
UCSC-VLAA / Recap-DataComp-1B
[ICML 2025] This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"
☆138Updated last year
linzhiqiu / CLIP-FlanT5
Training code for CLIP-FlanT5
☆27Updated last year
RoyiRa / Linguistic-Binding-in-Diffusion-Models
☆81Updated 8 months ago
YujieLu10 / LLMScore
LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation
☆132Updated last year
eclipse-t2i / eclipse-inference
[CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"
☆64Updated last year
showlab / T2VScore
T2VScore: Towards A Better Metric for Text-to-Video Generation
☆80Updated last year
navervision / CompoDiff
Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)
☆85Updated 6 months ago
jacklishufan / diffusion-kto
The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utility
☆57Updated 6 months ago
bahjat-kawar / time-diffusion
Official code repo for "Editing Implicit Assumptions in Text-to-Image Diffusion Models"
☆86Updated 2 years ago
fusiming3 / MARS
Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
☆85Updated last year
JourneyDB / JourneyDB
☆174Updated 2 years ago
shunk031 / training-free-structured-diffusion-guidance
🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Text…
☆120Updated 2 years ago
tgxs002 / align_sd
Better Aligning Text-to-Image Models with Human Preference. ICCV 2023
☆287Updated 2 years ago
TIGER-AI-Lab / VideoScore
official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]
☆94Updated 5 months ago
aszala / VPEval
VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)
☆45Updated last year
ruocwang / dpo-diffusion
[ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google
☆59Updated 11 months ago
Shentao-YANG / Dense_Reward_T2I
Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).
☆39Updated last year
huggingface / amused
☆86Updated last year