djghosh13/geneval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/djghosh13/geneval)

djghosh13 / geneval

GenEval: An object-focused framework for evaluating text-to-image alignment

☆472

Alternatives and similar repositories for geneval

Users that are interested in geneval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PKU-YuanGroup / WISE
View on GitHub
[ICML 2026🔥] WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation
☆212Jun 26, 2026Updated last month
Karine-Huang / T2I-CompBench
View on GitHub
[Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation
☆345May 7, 2026Updated 2 months ago
MizzenAI / HPSv3
View on GitHub
Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)
☆328Dec 5, 2025Updated 7 months ago
yifan123 / flow_grpo
View on GitHub
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
☆2,434May 7, 2026Updated 2 months ago
OneIG-Bench / OneIG-Benchmark
View on GitHub
[NeurIPS 2025 DB] OneIG-Bench is a meticulously designed comprehensive benchmark framework for fine-grained evaluation of T2I models acro…
☆120Feb 10, 2026Updated 5 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
zai-org / VisionReward
View on GitHub
[AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
☆422Mar 26, 2025Updated last year
tgxs002 / HPSv2
View on GitHub
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
☆677May 24, 2024Updated 2 years ago
zai-org / ImageReward
View on GitHub
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
☆1,694Oct 29, 2025Updated 8 months ago
facebookresearch / GenEval2
View on GitHub
Evaluation codes and data for GenEval2
☆80Jan 8, 2026Updated 6 months ago
PKU-YuanGroup / ImgEdit
View on GitHub
[NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark
☆329Nov 5, 2025Updated 8 months ago
wusize / Harmon
View on GitHub
[ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
☆192May 21, 2025Updated last year
JiuhaiChen / BLIP3o
View on GitHub
Official implementation of BLIP3o-Series
☆1,663Nov 29, 2025Updated 7 months ago
TencentQQGYLab / ELLA
View on GitHub
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
☆1,285Jul 17, 2024Updated 2 years ago
yuvalkirstain / PickScore
View on GitHub
☆601Dec 21, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
XueZeyue / DanceGRPO
View on GitHub
An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation
☆1,641Oct 16, 2025Updated 9 months ago
sihyun-yu / REPA
View on GitHub
[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
☆1,680Mar 16, 2025Updated last year
Vchitect / VBench
View on GitHub
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
☆1,709Mar 23, 2026Updated 4 months ago
ByteDance-Seed / Bagel
View on GitHub
Open-source unified multimodal model
☆6,123May 4, 2026Updated 2 months ago
tianweiy / DMD2
View on GitHub
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
☆1,412Mar 5, 2025Updated last year
NVlabs / DiffusionNFT
View on GitHub
[ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process
☆987Feb 10, 2026Updated 5 months ago
ATH-MaaS / Awesome-Unified-Multimodal-Models
View on GitHub
Awesome Unified Multimodal Models
☆1,306Mar 24, 2026Updated 4 months ago
FoundationVision / LlamaGen
View on GitHub
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
☆1,960Aug 15, 2024Updated last year
yifan123 / reward-server
View on GitHub
☆73Jul 10, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
SalesforceAIResearch / DiffusionDPO
View on GitHub
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
☆706Jun 2, 2026Updated last month
showlab / Show-o
View on GitHub
[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.
☆1,964Jan 8, 2026Updated 6 months ago
FoundationVision / Infinity
View on GitHub
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
☆1,579Apr 16, 2026Updated 3 months ago
facebookresearch / metaquery
View on GitHub
Official Implementation of Paper Transfer between Modalities with MetaQueries
☆325Oct 12, 2025Updated 9 months ago
linzhiqiu / t2v_metrics
View on GitHub
Evaluating text-to-image/video/3D models with VQAScore
☆598Jun 5, 2026Updated last month
CodeGoat24 / UnifiedReward
View on GitHub
Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think & UnifiedReward-Flex
☆796Jun 18, 2026Updated last month
KlingAIResearch / VideoAlign
View on GitHub
[NeurIPS 2025] Improving Video Generation with Human Feedback
☆488Sep 24, 2025Updated 10 months ago
ZiyuGuo99 / Image-Generation-CoT
View on GitHub
[CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation
☆865Mar 19, 2026Updated 4 months ago
FoundationVision / UniTok
View on GitHub
[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
☆529Nov 14, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
X-Omni-Team / X-Omni
View on GitHub
Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).
☆426Aug 26, 2025Updated 11 months ago
PixArt-alpha / PixArt-alpha
View on GitHub
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
☆3,300Oct 31, 2024Updated last year
hustvl / LightningDiT
View on GitHub
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
☆1,510Dec 16, 2025Updated 7 months ago
wusize / OpenUni
View on GitHub
☆189Jun 27, 2025Updated last year
PKU-YuanGroup / UniWorld
View on GitHub
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
☆886Dec 23, 2025Updated 7 months ago
LTH14 / mar
View on GitHub
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
☆1,943Feb 20, 2026Updated 5 months ago
PixArt-alpha / PixArt-sigma
View on GitHub
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
☆1,933Oct 31, 2024Updated last year