🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic evaluations of text-to-image models and supports customization with user-defined metrics, datasets, and visualizations.
☆90Feb 5, 2026Updated last month
Alternatives and similar repositories for EvalGIM
Users that are interested in EvalGIM are comparing it to the libraries listed below
Sorting:
- Official implementation of SimFlow☆27Dec 16, 2025Updated 2 months ago
- Code for ExploreTom☆91Jun 25, 2025Updated 8 months ago
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 5 months ago
- Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation☆27Dec 12, 2025Updated 2 months ago
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]☆16Sep 12, 2025Updated 5 months ago
- Code for ICLR 2024 paper "Towards Optimal Feature-Shaping Methods for Out-of-Distribution Detection"☆17Apr 20, 2024Updated last year
- Host CIFAR-10.2 Data Set☆13Sep 22, 2021Updated 4 years ago
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆120Jan 10, 2026Updated last month
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆20Oct 17, 2024Updated last year
- This repository contains the code of our paper 'Skip \n: A simple method to reduce hallucination in Large Vision-Language Models'.☆15Feb 12, 2024Updated 2 years ago
- ☆43May 30, 2025Updated 9 months ago
- [ICLR 2024 Spotlight] Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments☆20Aug 19, 2025Updated 6 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆23Jul 30, 2024Updated last year
- A suite of image and video neural tokenizers☆1,711Feb 11, 2025Updated last year
- CIFAR-10-Warehouse: Towards Broad and More Realistic Testbeds in Model Generalization Analysis☆18Jul 15, 2024Updated last year
- ☆22Feb 12, 2025Updated last year
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆53May 8, 2025Updated 9 months ago
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆53Mar 16, 2025Updated 11 months ago
- ☆29Mar 30, 2025Updated 11 months ago
- ☆28Mar 4, 2025Updated last year
- ICML2025☆63Aug 28, 2025Updated 6 months ago
- Official PyTorch implementation of the paper "Generating Novel Scene Compositions from Single Images and Videos"☆44Sep 22, 2022Updated 3 years ago
- Code, Data and Red Teaming for ZeroBench☆54Dec 23, 2025Updated 2 months ago
- ☆21Oct 10, 2024Updated last year
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆22Dec 4, 2024Updated last year
- This repository includes various baseline techniques for label-free model evaluation task for the VDU2023 competition.☆19Mar 8, 2023Updated 2 years ago
- Official PyTorch implementation of FlowMo.☆114Apr 7, 2025Updated 10 months ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆166Jan 31, 2025Updated last year
- Utilities for PyTorch distributed☆25Feb 27, 2025Updated last year
- NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024☆1,815Nov 27, 2025Updated 3 months ago
- Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders☆212Feb 13, 2026Updated 3 weeks ago
- Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Fee…☆27Nov 11, 2023Updated 2 years ago
- ☆29Nov 9, 2025Updated 3 months ago
- [NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization☆164Sep 15, 2025Updated 5 months ago
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆104Dec 9, 2024Updated last year
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆32Mar 26, 2025Updated 11 months ago
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"☆175Feb 24, 2026Updated last week
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Jul 25, 2023Updated 2 years ago