jy0205 / Pyramid-Flow
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
☆2,219Updated last week
Related projects ⓘ
Alternatives and complementary repositories for Pyramid-Flow
- The best OSS video generation models☆1,848Updated this week
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,070Updated 3 months ago
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆2,578Updated last week
- ☆1,595Updated this week
- A general fine-tuning kit geared toward diffusion models.☆1,778Updated this week
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆1,874Updated last month
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,669Updated last week
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,247Updated 3 weeks ago
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340☆2,222Updated this week
- StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,413Updated 2 months ago
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆1,659Updated last month
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,573Updated 2 months ago
- Various AI scripts. Mostly Stable Diffusion stuff.☆3,331Updated last week
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA☆1,396Updated last month
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆1,301Updated this week
- ☆613Updated this week
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,255Updated 3 months ago
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…☆556Updated last week
- Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"☆1,306Updated last month
- Dead simple FLUX LoRA training UI with LOW VRAM support☆1,258Updated last week
- [ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion …☆1,440Updated 2 months ago
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment☆1,086Updated 3 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,689Updated last month
- Transparent Image Layer Diffusion using Latent Transparency☆2,017Updated 4 months ago
- Kolors Team☆3,826Updated 2 months ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆2,791Updated last week
- 👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing☆1,031Updated 3 weeks ago
- Text-to-Music Generation with Rectified Flow Transformers☆1,592Updated 2 months ago
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.☆622Updated 3 months ago
- ☆562Updated this week