ostris / ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
☆3,408Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for ai-toolkit
- A general fine-tuning kit geared toward diffusion models.☆1,811Updated this week
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆2,637Updated 2 weeks ago
- Dead simple FLUX LoRA training UI with LOW VRAM support☆1,325Updated this week
- ☆1,625Updated last week
- ☆1,119Updated 3 weeks ago
- Code of Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,340Updated this week
- The best OSS video generation models☆2,050Updated this week
- OneTrainer is a one-stop solution for all your stable diffusion training needs.☆1,786Updated this week
- ComfyUI nodes for LivePortrait☆1,648Updated 3 months ago
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340☆2,769Updated this week
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,086Updated 3 months ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,589Updated 2 months ago
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆1,676Updated 2 months ago
- ☆4,148Updated 2 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,693Updated last month
- Character Animation (AnimateAnyone, Face Reenactment)☆3,185Updated 5 months ago
- ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable,…☆6,966Updated this week
- SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible,…☆1,425Updated this week
- An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc…☆2,371Updated last week
- Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"☆1,319Updated last month
- A powerful tool that translates ComfyUI workflows into executable Python code.☆1,240Updated 2 months ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆2,809Updated 3 weeks ago
- ☆1,873Updated 3 months ago
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,278Updated 3 months ago
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆1,490Updated this week
- Examples of ComfyUI workflows☆1,960Updated 2 weeks ago
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆5,955Updated last month
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,679Updated 3 weeks ago
- ControlNet++: All-in-one ControlNet for image generations and editing!☆1,758Updated last month
- GGUF Quantization support for native ComfyUI models☆1,039Updated 2 weeks ago