VectorSpaceLab / OmniGen
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
☆3,868Updated last month
Alternatives and similar repositories for OmniGen:
Users that are interested in OmniGen are comparing it to the libraries listed below
- Official repository for LTX-Video☆3,221Updated 3 weeks ago
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,098Updated 3 weeks ago
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆3,222Updated 4 months ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆3,796Updated this week
- SkyReels V1: The first and most advanced open-source human-centric video foundation model☆1,915Updated 3 weeks ago
- ☆1,967Updated 4 months ago
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,282Updated 6 months ago
- ☆2,292Updated 3 weeks ago
- The best OSS video generation models☆3,056Updated 2 months ago
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,862Updated 3 months ago
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆9,454Updated 3 weeks ago
- A general fine-tuning kit geared toward diffusion models.☆2,161Updated this week
- A minimal and universal controller for FLUX.1.☆1,348Updated 3 weeks ago
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,213Updated this week
- Dead simple FLUX LoRA training UI with LOW VRAM support☆2,251Updated last week
- Kolors Team☆4,308Updated 4 months ago
- ☆1,103Updated 2 months ago
- [CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System☆3,234Updated last month
- Taming Stable Diffusion for Lip Sync!☆3,424Updated last week
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,170Updated last month
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,782Updated 5 months ago
- Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"☆1,471Updated 2 months ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,017Updated 5 months ago
- [CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation☆2,033Updated this week
- 📹 A more flexible framework that can generate videos at any resolution and creates videos from images.☆786Updated this week
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA☆1,548Updated 6 months ago
- [CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,523Updated this week
- Official repository of In-Context LoRA for Diffusion Transformers☆1,736Updated 3 months ago
- ☆2,259Updated 7 months ago
- ☆2,734Updated 2 weeks ago