npurson / fid-metrics
A toolkit for computing Fréchet Inception Distance (FID) & Fréchet Video Distance (FVD) metrics.
☆22Updated 2 weeks ago
Alternatives and similar repositories for fid-metrics:
Users that are interested in fid-metrics are comparing it to the libraries listed below
- Towards training VQ-VAE models robustly!☆57Updated 2 months ago
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆58Updated 3 weeks ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆77Updated 4 months ago
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆103Updated 5 months ago
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆94Updated 2 months ago
- ☆164Updated last month
- This is the official implementation for ControlVAR.☆99Updated 3 months ago
- [CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models☆69Updated 6 months ago
- The official repository of DreamMover☆30Updated 6 months ago
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆38Updated 3 months ago
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆66Updated 5 months ago
- ☆77Updated 9 months ago
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆95Updated last year
- ☆38Updated last year
- The official PyTorch implementation of Fast Diffusion Model☆95Updated last year
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆37Updated 6 months ago
- Official code for the paper 'DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space'☆21Updated 2 months ago
- "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liu☆49Updated 3 months ago
- Official code for Accelerating Diffusion Sampling with Optimized Time Steps (CVPR 2024)☆29Updated last year
- [CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners☆140Updated 8 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆137Updated last month
- ☆144Updated 3 months ago
- Sora Generates Videos with Stunning Geometrical Consistency☆49Updated 11 months ago
- Training-Free Condition-Guided Text-to-Video Generation☆62Updated last year