ostris / ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
☆3,817Updated 2 weeks ago
Alternatives and similar repositories for ai-toolkit:
Users that are interested in ai-toolkit are comparing it to the libraries listed below
- A general fine-tuning kit geared toward diffusion models.☆2,005Updated this week
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆2,982Updated last month
- ☆1,822Updated 2 months ago
- Dead simple FLUX LoRA training UI with LOW VRAM support☆1,726Updated last week
- Official repository for LTX-Video☆2,562Updated 2 weeks ago
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆5,547Updated 6 months ago
- OneTrainer is a one-stop solution for all your stable diffusion training needs.☆1,943Updated this week
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆2,407Updated this week
- ☆1,279Updated 2 months ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆2,931Updated 2 months ago
- ☆4,476Updated 4 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,126Updated 5 months ago
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆1,734Updated 3 months ago
- Examples of ComfyUI workflows☆2,255Updated last month
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,116Updated 3 months ago
- The best OSS video generation models☆2,718Updated last week
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,738Updated 2 months ago
- Kolors Team☆4,108Updated 2 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,739Updated 3 weeks ago
- StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, …☆4,669Updated 5 months ago
- A powerful tool that translates ComfyUI workflows into executable Python code.☆1,413Updated this week
- ControlNet++: All-in-one ControlNet for image generations and editing!☆1,833Updated 3 months ago
- ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable,…☆7,950Updated this week
- Code of Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,701Updated 3 weeks ago
- Official repository of In-Context LoRA for Diffusion Transformers☆1,480Updated 3 weeks ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,717Updated 4 months ago
- SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible,…☆1,798Updated this week
- ☆1,272Updated this week
- An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc…☆2,589Updated last month
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆1,680Updated this week