npurson / fid-metricsLinks
A toolkit for computing Fréchet Inception Distance (FID) & Fréchet Video Distance (FVD) metrics.
☆36Updated 5 months ago
Alternatives and similar repositories for fid-metrics
Users that are interested in fid-metrics are comparing it to the libraries listed below
Sorting:
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆131Updated last year
- Official Implementation of VideoDPO☆146Updated 5 months ago
- [CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners☆150Updated last year
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆88Updated 6 months ago
- [NeurIPS 2025] Improving Video Generation with Human Feedback☆311Updated last month
- Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization☆44Updated last month
- FACM: Flow-Anchored Consistency Models☆124Updated 2 months ago
- [ECCV 2024 Oral] Audio-Synchronized Visual Animation☆56Updated last year
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆284Updated 10 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆146Updated 8 months ago
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆96Updated last year
- CCEdit: Creative and Controllable Video Editing via Diffusion Models☆114Updated last year
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍☆47Updated 3 months ago
- Comparison between Frechet Video Distance implementation from StyleGAN-V and the original paper☆119Updated 9 months ago
- Training-Free Condition-Guided Text-to-Video Generation☆60Updated last week
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆84Updated 11 months ago
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆145Updated last week
- [CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models☆75Updated last year
- This is the official implementation for ControlVAR.☆122Updated 10 months ago
- Official implementation of "HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment"☆82Updated 6 months ago
- Phantom-Data: Towards a General Subject-Consistent Video Generation Dataset☆91Updated last week
- ☆76Updated 7 months ago
- ☆37Updated 11 months ago
- Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dyna…☆186Updated 2 years ago
- ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models☆84Updated last month
- The official SpeakerVid-5M data curation code.☆48Updated 3 months ago
- Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos☆72Updated last year
- [Arxiv 2024] Official code for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions☆33Updated 8 months ago
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆157Updated 6 months ago
- Text-conditioned image-to-video generation based on diffusion models.☆55Updated last year