black-forest-labs / flux
Official inference repo for FLUX.1 models
☆15,956Updated this week
Related projects ⓘ
Alternatives and complementary repositories for flux
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆5,955Updated last month
- Various AI scripts. Mostly Stable Diffusion stuff.☆3,408Updated 3 weeks ago
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆57,116Updated this week
- ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable,…☆6,966Updated this week
- ☆8,526Updated this week
- More relighting!☆5,545Updated 3 weeks ago
- Bring portraits to life!☆13,012Updated last week
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆7,645Updated 4 months ago
- Open-Sora: Democratizing Efficient Video Production for All☆22,289Updated this week
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆2,637Updated 2 weeks ago
- [SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation☆5,365Updated 2 months ago
- Enjoy the magic of Diffusion models!☆6,589Updated this week
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340☆2,769Updated this week
- StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, …☆4,588Updated 3 months ago
- Your image is almost there!☆7,334Updated 3 months ago
- [WIP] Layer Diffusion for WebUI (via Forge)☆3,885Updated 2 months ago
- Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.☆6,919Updated this week
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆7,647Updated this week
- Inference and training library for high-quality TTS models.☆4,658Updated 3 weeks ago
- ☆1,625Updated last week
- Kolors Team☆3,872Updated last week
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆9,204Updated this week
- A general fine-tuning kit geared toward diffusion models.☆1,811Updated this week
- Brand new TTS solution☆14,572Updated this week
- SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible,…☆1,425Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆47,259Updated this week
- Code of Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,340Updated this week
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆7,251Updated this week
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,652Updated 4 months ago
- Official implementation of AnimateDiff.☆10,603Updated 3 months ago