npurson / fid-metrics
A toolkit for computing Fréchet Inception Distance (FID) & Fréchet Video Distance (FVD) metrics.
☆11Updated last year
Related projects ⓘ
Alternatives and complementary repositories for fid-metrics
- Efficient synchronization from sparse cues☆28Updated 6 months ago
- ☆25Updated last year
- An implementation of simple diffusion in PyTorch (and JAX)☆35Updated last year
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆53Updated 2 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- official code for Diff-Instruct algorithm for one-step diffusion distillation☆46Updated 7 months ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆76Updated 4 months ago
- Official implementation of OSSGAN [CVPR 2022]☆22Updated 2 years ago
- Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".☆76Updated 11 months ago
- ☆31Updated 3 weeks ago
- ☆21Updated last year
- Code for Vision-Infused Deep Audio Inpainting (ICCV 2019)☆56Updated 5 years ago
- [ECCV 2024 Oral] Audio-Synchronized Visual Animation☆34Updated 2 months ago
- The official code of WaveGAN: Frequency-aware GAN for High-Fidelity Few-shot Image Generation (ECCV2022)☆73Updated last year
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆27Updated 7 months ago
- Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS☆35Updated last year
- Unofficial implementation of Neural Analysis and Synthesis☆7Updated 2 years ago
- Official implemention for Diffusion Models Are Innate One-Step Generators☆20Updated 5 months ago
- [ICLR 2024] Official code for the paper 'Elucidating the Exposure Bias in Diffusion Models'☆39Updated 6 months ago
- Codebase and project page for EDMSound☆29Updated 11 months ago
- The project page repo for Neural Dubber.☆27Updated last year
- [ACM MM 2024] Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization☆12Updated 2 months ago
- Score identity Distillation with Long and Short Guidance for One-Step Text-to-Image Generation☆33Updated 2 months ago
- The official repo for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation☆13Updated 3 weeks ago
- ☆46Updated 4 months ago
- The official PyTorch implementation of Fast Diffusion Model☆91Updated last year
- official implementation of the paper: Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transform…☆29Updated last year
- [CVPR 2023] GLeaD: Improving GANs with A Generator-Leading Task☆32Updated last year