A toolkit for computing Fréchet Inception Distance (FID) & Fréchet Video Distance (FVD) metrics.
☆45May 29, 2025Updated 9 months ago
Alternatives and similar repositories for fid-metrics
Users that are interested in fid-metrics are comparing it to the libraries listed below
Sorting:
- ☆13Dec 22, 2023Updated 2 years ago
- You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.☆556Jan 17, 2026Updated last month
- STOI loss functions in PyTorch (mirror of https://github.com/mpariente/pytorch_stoi)☆15Aug 6, 2020Updated 5 years ago
- ☆20Jul 13, 2022Updated 3 years ago
- The open source implementation of the cross attention mechanism from the paper: "JOINTLY TRAINING LARGE AUTOREGRESSIVE MULTIMODAL MODELS"☆37Mar 11, 2024Updated last year
- diffusion model baesd video-virtual-try-on☆26Feb 20, 2024Updated 2 years ago
- Enhancment of Audio Quality (Bit-Depth and Sampling-Rate) using Deep Learning.☆33Mar 19, 2020Updated 5 years ago
- Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos☆81Jul 26, 2024Updated last year
- experiments about AudioSet☆43Jul 22, 2023Updated 2 years ago
- SimADFuzz: Simulation-Feedback Fuzz Testing for Autonomous Driving Systems☆10Apr 11, 2025Updated 10 months ago
- Contrastive Language-Audio Pretraining☆87Mar 6, 2022Updated 4 years ago
- A QGIS plugin for mineral prospectivity mapping☆17Jul 3, 2025Updated 8 months ago
- Towards Photorealistic 4D Scene Generation via Video Diffusion Models☆20Jun 12, 2024Updated last year
- Whisper finetuning☆16Apr 9, 2025Updated 11 months ago
- Course: Algorithms for Data Science☆16Feb 7, 2020Updated 6 years ago
- ☆13May 30, 2025Updated 9 months ago
- [ICLR 2025] Trajectory Attention For Fine-grained Video Motion Control☆99May 13, 2025Updated 9 months ago
- ☆41Dec 15, 2023Updated 2 years ago
- Filter Banks, Fast Python Implementation☆42Jul 9, 2022Updated 3 years ago
- ☆45Jun 11, 2024Updated last year
- the codes are for the series CNN baselines tested in our wildfile flame detection dataset.☆10Nov 21, 2022Updated 3 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 4 years ago
- Implementation for NATv2.☆23Feb 20, 2021Updated 5 years ago
- Image and video processing toolbox☆10Jun 12, 2020Updated 5 years ago
- Python bindings for NVIDIA CUDA APIs.☆13Mar 2, 2024Updated 2 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- ☆10Apr 8, 2024Updated last year
- 【ICME2025 Oral】Offical Pytorch Code for "Fraesormer: Learning Adaptive Sparse Transformer for Efficient Food Recognition"☆11Mar 21, 2025Updated 11 months ago
- List of papers about TTS / Список статей о TTS☆10Dec 16, 2017Updated 8 years ago
- LaTeX tutorial using Texmaker. This repository follows [Michelle Krummel's Tutorial](https://www.youtube.com/watch?v=SoDv0qhyysQ&list=PL1…☆11Jun 14, 2018Updated 7 years ago
- ☆13Mar 10, 2024Updated last year
- This repository is dedicated to Track 2 of the W-CODA 2024 Workshop, "Multimodal Perception and Comprehension of Corner Cases in Autonomo…☆17Jun 12, 2024Updated last year
- Official implementation and project page of the CVPR'24 paper "VMINer: Versatile Multi-view Inverse Rendering with Near- and Far-field Li…☆14Aug 6, 2024Updated last year
- Determines the ethnicity based on your last name☆10Aug 17, 2014Updated 11 years ago
- ☆10Jan 5, 2021Updated 5 years ago
- ☆10Nov 18, 2024Updated last year
- A Multimodal Generative World Model for Autonomous Driving with Geometric Representations☆13Aug 27, 2025Updated 6 months ago
- This repository follows Luke Smith's Latex Resume Tutorial.☆10Jun 21, 2019Updated 6 years ago
- Facial Alignment for Anime Styled Faces☆10Mar 26, 2021Updated 4 years ago