VectorSpaceLab / OmniGen
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
☆3,610Updated this week
Alternatives and similar repositories for OmniGen:
Users that are interested in OmniGen are comparing it to the libraries listed below
- Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System☆2,912Updated last month
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆3,103Updated 2 months ago
- [arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis☆1,096Updated this week
- Official repository of In-Context LoRA for Diffusion Transformers☆1,595Updated 2 months ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆3,413Updated last week
- ☆1,889Updated 3 months ago
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆1,987Updated this week
- Official repository for LTX-Video☆2,857Updated this week
- Taming Stable Diffusion for Lip Sync!☆2,583Updated last month
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,204Updated 4 months ago
- Code of Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,784Updated 2 months ago
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,762Updated 3 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,156Updated this week
- Kolors Team☆4,195Updated 3 months ago
- ☆1,897Updated this week
- A general fine-tuning kit geared toward diffusion models.☆2,092Updated last week
- A minimal and universal controller for FLUX.1.☆1,214Updated 3 weeks ago
- EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation☆2,832Updated 3 weeks ago
- StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,503Updated 2 months ago
- Dead simple FLUX LoRA training UI with LOW VRAM support☆2,004Updated last month
- ☆2,174Updated 6 months ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,777Updated 5 months ago
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆3,913Updated last month
- ComfyUI nodes for LivePortrait☆1,837Updated 6 months ago
- ☆983Updated last month
- Transparent Image Layer Diffusion using Latent Transparency☆2,071Updated 8 months ago
- [ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2)…☆1,188Updated this week
- Improved AnimateDiff for ComfyUI and Advanced Sampling Support☆2,974Updated this week
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆2,971Updated 3 months ago
- ☆4,638Updated this week