Yaofang-Liu / FVDMView external linksLinks
Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'
☆34Jan 2, 2026Updated last month
Alternatives and similar repositories for FVDM
Users that are interested in FVDM are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Jul 4, 2025Updated 7 months ago
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Jun 15, 2024Updated last year
- ☆31Sep 1, 2025Updated 5 months ago
- Reward Guided Latent Consistency Distillation☆26Oct 9, 2024Updated last year
- [NeurIPS 2025] Reward-Instruct: A Reward-Centric Approach to Fast Photo-Realistic Image Generation☆34Oct 24, 2025Updated 3 months ago
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆35Mar 12, 2024Updated last year
- ☆30May 9, 2024Updated last year
- Minimal PyTorch implementation of TP, SP, FSDP and sharded-EMA☆31Nov 27, 2025Updated 2 months ago
- ☆15Mar 30, 2025Updated 10 months ago
- [CVPR2025] Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation☆18May 2, 2025Updated 9 months ago
- ☆16Oct 4, 2024Updated last year
- ☆19Apr 28, 2023Updated 2 years ago
- Pusa: Thousands Timesteps Video Diffusion Model☆671Updated this week
- ☆17Feb 20, 2025Updated 11 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆307Mar 12, 2025Updated 11 months ago
- An innovative method designed to augment the capabilities of existing video diffusion models☆22May 10, 2024Updated last year
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆20Jan 26, 2025Updated last year
- Image captioning with weight pruning in PyTorch☆22Jan 14, 2022Updated 4 years ago
- (TPAMI'2024) ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation☆22Aug 8, 2024Updated last year
- Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation☆23Jul 30, 2025Updated 6 months ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆90Oct 12, 2024Updated last year
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…☆64Jul 2, 2025Updated 7 months ago
- ☆44Sep 1, 2025Updated 5 months ago
- ☆27Apr 9, 2023Updated 2 years ago
- [CVPR 2025] GPS as a Control Signal for Image Generation☆25Mar 18, 2025Updated 10 months ago
- [AAAI 2025] Follow-Your-Canvas: This repo is the official implementation of "Follow-Your-Canvas: Higher-Resolution Video Outpainting with…☆161Aug 26, 2025Updated 5 months ago
- This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]☆22Jul 21, 2024Updated last year
- This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)☆25Dec 7, 2023Updated 2 years ago
- [IJCV 2026] HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts☆26Feb 28, 2025Updated 11 months ago
- Code for full fintuing Mochi model with FSDP (and CP)☆30Jul 15, 2025Updated 7 months ago
- [NeurIPS 2023 Datasets and Benchmarks] "FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation", Yuanxin L…☆57Mar 4, 2024Updated last year
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆69May 18, 2025Updated 8 months ago
- Code of the paper "FreePCA:Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Princi…☆28Aug 26, 2025Updated 5 months ago
- [ICML2025] LoRA fine-tune directly on the quantized models.☆39Nov 25, 2024Updated last year
- Official PyTorch implementation for the paper "AnimateZero: Video Diffusion Models are Zero-Shot Image Animators"☆352Dec 8, 2023Updated 2 years ago
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"☆77Oct 15, 2024Updated last year
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆73Oct 21, 2025Updated 3 months ago
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models☆947Nov 13, 2024Updated last year
- ☆66Jun 4, 2024Updated last year