sayakpaul / cmmd-pytorchView external linksLinks
PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.
☆159Apr 5, 2024Updated last year
Alternatives and similar repositories for cmmd-pytorch
Users that are interested in cmmd-pytorch are comparing it to the libraries listed below
Sorting:
- PyTorch - FID calculation with proper image resizing and quantization steps [CVPR 2022]☆1,138Aug 2, 2025Updated 6 months ago
- LCM Full Cycle Trainer for Ostris - Ai Toolkit☆16Aug 20, 2024Updated last year
- ☆474Jun 20, 2025Updated 7 months ago
- Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models☆556Apr 6, 2024Updated last year
- PyTorch implementation of "UNIT: Unifying Image and Text Recognition in One Vision Encoder", NeurlPS 2024.☆34Sep 26, 2024Updated last year
- Official implementation for "CONVIQT: Contrastive Video Quality Estimator"☆23Jun 14, 2022Updated 3 years ago
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆762Jan 26, 2024Updated 2 years ago
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆534Sep 8, 2025Updated 5 months ago
- Synthetic Face Recognition☆19Oct 30, 2023Updated 2 years ago
- ☆182Oct 28, 2024Updated last year
- Implementation of "SVDiff: Compact Parameter Space for Diffusion Fine-Tuning"☆384Jan 24, 2024Updated 2 years ago
- Official implementation of Inductive Moment Matching☆572Jul 11, 2025Updated 7 months ago
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆645May 24, 2024Updated last year
- ***ECCV 2024*** AdaDistill: Adaptive Knowledge Distillation for Deep Face Recognition☆26Sep 22, 2024Updated last year
- EDM2 and Autoguidance -- Official PyTorch implementation☆819Dec 9, 2024Updated last year
- diffusers with search engine☆12Jan 13, 2026Updated last month
- Implementation for NATv2.☆23Feb 20, 2021Updated 4 years ago
- Albumentations Data Augmentation Plugin for FiftyOne!☆14Aug 22, 2024Updated last year
- [ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"☆13Aug 6, 2024Updated last year
- Various palmprint feature extraction techniques: CompCode, Local Tetra Patterns, RLOC☆10Sep 4, 2019Updated 6 years ago
- Cricket analytics for humans 🏏☆12Sep 4, 2022Updated 3 years ago
- Official Implementation for "ReMOVE: A Reference-free Metric for Object Erasure"☆24Apr 30, 2024Updated last year
- [CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor☆521Apr 2, 2024Updated last year
- Train high-quality text-to-image diffusion models in a data & compute efficient manner☆515Mar 27, 2025Updated 10 months ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,279Oct 31, 2024Updated last year
- [CVPR2025] RORem: Training a Robust Object Remover with Human-in-the-Loop☆64Sep 9, 2025Updated 5 months ago
- utilities to facilitate working with codebases that don't ascribe to normal package management paradigms, e.g. ML research code that can …☆13Nov 26, 2022Updated 3 years ago
- Caffe/Neon prototxt training file for our Neurocomputing2017 work: Fuzzy Quantitative Deep Compression Network☆12May 30, 2018Updated 7 years ago
- The official repository of paper "ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection" (N…☆50Oct 23, 2023Updated 2 years ago
- Official repo for "GMS-3DQA: Projection-based Grid Mini-patch Sampling for 3D Model Quality Assessment"☆14Mar 10, 2024Updated last year
- Official implementation of Decoupled MeanFlow☆34Oct 28, 2025Updated 3 months ago
- [ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation☆506Oct 31, 2024Updated last year
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆158Oct 25, 2023Updated 2 years ago
- Implementation for "Correcting Diffusion Generation through Resampling" [CVPR 2024]☆34Dec 12, 2023Updated 2 years ago
- Official Code for MIMETIC^2☆13Nov 19, 2024Updated last year
- Run SOTA Vision-Language Model Florence-2 on your data!☆15Mar 27, 2025Updated 10 months ago
- Pretraining summarization models using a corpus of nonsense☆13Sep 28, 2021Updated 4 years ago
- A collection of various custom nodes for ComfyUI (Work in progress)☆14Jun 9, 2025Updated 8 months ago
- ☆16Jan 28, 2024Updated 2 years ago