VectorSpaceLab / OmniGenLinks
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
☆4,219Updated last month
Alternatives and similar repositories for OmniGen
Users that are interested in OmniGen are comparing it to the libraries listed below
Sorting:
- ☆2,324Updated last month
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆3,001Updated 6 months ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,365Updated 2 weeks ago
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,181Updated 4 months ago
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆3,416Updated 2 months ago
- Official implementations for paper: VACE: All-in-One Video Creation and Editing☆2,930Updated 2 months ago
- The best OSS video generation models☆3,298Updated 6 months ago
- [CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System☆3,493Updated 3 months ago
- ☆2,135Updated 8 months ago
- SkyReels V1: The first and most advanced open-source human-centric video foundation model☆2,226Updated 4 months ago
- Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persisten…☆1,831Updated 2 months ago
- ☆2,501Updated 2 months ago
- MAGI-1: Autoregressive Video Generation at Scale☆3,378Updated last month
- A general fine-tuning kit geared toward diffusion models.☆2,440Updated this week
- Official repository for LTX-Video☆7,022Updated last week
- Dead simple FLUX LoRA training UI with LOW VRAM support☆2,815Updated 3 months ago
- [ICCV 2025] OminiControl: Minimal and Universal Control for Diffusion Transformer☆1,692Updated 2 weeks ago
- Official repository of In-Context LoRA for Diffusion Transformers☆1,957Updated 6 months ago
- LTX-Video Support for ComfyUI☆2,165Updated last week
- ☆1,244Updated 6 months ago
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,574Updated last month
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,204Updated 5 months ago
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆1,299Updated 3 weeks ago
- [CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis☆1,726Updated 2 months ago
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,403Updated 3 weeks ago
- Open-source unified multimodal model☆4,604Updated 2 weeks ago
- Wan 2.1 for the GPU Poor☆1,652Updated this week
- [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,444Updated last week
- SkyReels-V2: Infinite-length Film Generative model☆3,439Updated 3 weeks ago
- ☆3,070Updated 4 months ago