VectorSpaceLab / OmniGenLinks
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
☆4,144Updated last week
Alternatives and similar repositories for OmniGen
Users that are interested in OmniGen are comparing it to the libraries listed below
Sorting:
- ☆2,214Updated this week
- The best OSS video generation models☆3,219Updated 5 months ago
- A general fine-tuning kit geared toward diffusion models.☆2,377Updated last week
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,279Updated 2 weeks ago
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,170Updated 3 months ago
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆3,396Updated last month
- Official implementations for paper: VACE: All-in-One Video Creation and Editing☆2,648Updated last month
- A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…☆1,426Updated this week
- Dead simple FLUX LoRA training UI with LOW VRAM support☆2,689Updated 2 months ago
- Official repository for LTX-Video☆6,745Updated 3 weeks ago
- A minimal and universal controller for FLUX.1.☆1,639Updated 2 weeks ago
- Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persisten…☆1,718Updated last month
- The ultimate training toolkit for finetuning diffusion models☆4,875Updated this week
- SkyReels V1: The first and most advanced open-source human-centric video foundation model☆2,207Updated 3 months ago
- ☆2,113Updated 7 months ago
- Kolors Team☆4,462Updated 7 months ago
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,972Updated 6 months ago
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,507Updated last month
- Official repository of In-Context LoRA for Diffusion Transformers☆1,919Updated 6 months ago
- ☆2,472Updated last month
- GGUF Quantization support for native ComfyUI models☆2,082Updated last week
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆1,196Updated 2 weeks ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,198Updated 4 months ago
- MAGI-1: Autoregressive Video Generation at Scale☆3,302Updated this week
- LTX-Video Support for ComfyUI☆2,080Updated this week
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,113Updated 7 months ago
- 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning☆1,128Updated 2 months ago
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,168Updated 5 months ago
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,816Updated 7 months ago
- ☆2,410Updated 10 months ago