facebookresearch / EvalGIM
π¦Ύ EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic evaluations of text-to-image models and supports customization with user-defined metrics, datasets, and visualizations.
β61Updated 3 weeks ago
Alternatives and similar repositories for EvalGIM:
Users that are interested in EvalGIM are comparing it to the libraries listed below
- Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image β¦β62Updated last month
- This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.orβ¦β112Updated 6 months ago
- Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stitching"β97Updated 10 months ago
- β48Updated last year
- ElasticTok: Adaptive Tokenization for Image and Videoβ44Updated 2 months ago
- A Video Tokenizer Evaluation Datasetβ86Updated this week
- Official implementation of the paper The Hidden Language of Diffusion Modelsβ69Updated 11 months ago
- This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"β126Updated 7 months ago
- Official code for 'Paragraph-to-Image Generation with Information-Enriched Diffusion Model'β102Updated last month
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?β94Updated 2 months ago
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)β80Updated last month
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)β44Updated last year
- VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generationβ95Updated last week
- Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)β54Updated last year
- Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)β161Updated 6 months ago
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Modelsβ78Updated last year
- β45Updated 9 months ago
- β82Updated last year
- β67Updated 6 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"β61Updated 8 months ago
- This is a PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Framework for Cross-Modality Evoluβ¦β124Updated 2 weeks ago
- A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)β154Updated 3 months ago
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Dataβ33Updated 10 months ago
- β31Updated 4 months ago
- Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.β90Updated 9 months ago
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"β118Updated 4 months ago
- TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answeringβ141Updated 8 months ago
- Matryoshka Multimodal Modelsβ90Updated last month
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspectiveβ57Updated 2 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]β67Updated last month