genmoai / models
The best OSS video generation models
☆1,848Updated this week
Related projects ⓘ
Alternatives and complementary repositories for models
- Code of Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,219Updated last week
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,669Updated last week
- StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,413Updated 2 months ago
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA☆1,396Updated last month
- ☆1,595Updated this week
- Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"☆1,306Updated last month
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…☆556Updated last week
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,070Updated 3 months ago
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340☆2,222Updated this week
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆1,301Updated this week
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,573Updated 2 months ago
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆1,874Updated last month
- ☆613Updated this week
- A general fine-tuning kit geared toward diffusion models.☆1,778Updated this week
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆2,578Updated last week
- ☆562Updated this week
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆1,659Updated last month
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆2,791Updated last week
- VideoSys: An easy and efficient system for video generation☆1,759Updated this week
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.☆622Updated 3 months ago
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment☆1,086Updated 3 months ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆560Updated this week
- Official Code for MotionCtrl [SIGGRAPH 2024]☆1,322Updated last month
- ☆784Updated this week
- Create images of a given character in different poses☆580Updated 5 months ago
- Select a portrait, click to move the head around (please use your own space / GPU!)☆711Updated 3 weeks ago
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,247Updated 3 weeks ago
- Fine-Grained Open Domain Image Animation with Motion Guidance☆781Updated 3 weeks ago
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models☆875Updated 2 months ago