jy0205 / Pyramid-Flow
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
☆2,862Updated 3 months ago
Alternatives and similar repositories for Pyramid-Flow:
Users that are interested in Pyramid-Flow are comparing it to the libraries listed below
- Official repository for LTX-Video☆3,189Updated 3 weeks ago
- The best OSS video generation models☆3,044Updated 2 months ago
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,086Updated 3 weeks ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆3,796Updated this week
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,166Updated last month
- [CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,522Updated 3 months ago
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340☆3,816Updated last month
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,174Updated last week
- A minimal and universal controller for FLUX.1.☆1,328Updated 2 weeks ago
- A general fine-tuning kit geared toward diffusion models.☆2,151Updated this week
- Official repository of In-Context LoRA for Diffusion Transformers☆1,715Updated 3 months ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,810Updated 6 months ago
- Memory-optimized training library for diffusion models☆995Updated this week
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,282Updated 6 months ago
- ☆2,272Updated 2 weeks ago
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆3,197Updated 4 months ago
- SkyReels V1: The first and most advanced open-source human-centric video foundation model☆1,884Updated 2 weeks ago
- ☆2,719Updated last week
- Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"☆1,467Updated 2 months ago
- Various AI scripts. Mostly Stable Diffusion stuff.☆4,376Updated this week
- ☆772Updated 4 months ago
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA☆1,547Updated 6 months ago
- ☆715Updated last month
- [CVPR 2025] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis☆1,237Updated last week
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,489Updated 3 weeks ago
- Dead simple FLUX LoRA training UI with LOW VRAM support☆2,213Updated this week
- Video Generation Foundation Models: https://saiyan-world.github.io/goku/☆2,746Updated last month
- ☆1,953Updated 4 months ago
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,781Updated 4 months ago
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,246Updated 6 months ago