VectorSpaceLab / OmniGen
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
☆2,975Updated 2 weeks ago
Alternatives and similar repositories for OmniGen:
Users that are interested in OmniGen are comparing it to the libraries listed below
- The best OSS video generation models☆2,253Updated this week
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆1,550Updated this week
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆2,726Updated this week
- ☆1,681Updated 3 weeks ago
- Code of Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,493Updated this week
- Dead simple FLUX LoRA training UI with LOW VRAM support☆1,394Updated 2 weeks ago
- Official repository of In-Context LoRA for Diffusion Transformers☆1,217Updated 2 weeks ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆1,210Updated this week
- A general fine-tuning kit geared toward diffusion models.☆1,847Updated last week
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,094Updated 3 months ago
- Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System☆1,924Updated last week
- ☆1,943Updated 3 months ago
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆1,950Updated 2 months ago
- StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,441Updated this week
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,631Updated 2 months ago
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,695Updated last month
- Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"☆1,353Updated last week
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA☆1,419Updated 2 months ago
- ☆1,154Updated last month
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,293Updated 3 months ago
- Various AI scripts. Mostly Stable Diffusion stuff.☆3,512Updated this week
- GGUF Quantization support for native ComfyUI models☆1,095Updated last week
- ComfyUI nodes for LivePortrait☆1,676Updated 3 months ago
- Kolors Team☆3,924Updated 2 weeks ago
- ☆1,072Updated this week
- Official repository for LTX-Video☆1,645Updated last week
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆1,686Updated 2 months ago
- ☆663Updated 3 weeks ago
- EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation☆1,396Updated this week
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆2,859Updated last month