π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
β32,873Updated this week
Alternatives and similar repositories for diffusers
Users that are interested in diffusers are comparing it to the libraries listed below
Sorting:
- Let us control diffusion models!β33,663Feb 25, 2024Updated 2 years ago
- A latent text-to-image diffusion modelβ72,575Jun 18, 2024Updated last year
- High-Resolution Image Synthesis with Latent Diffusion Modelsβ13,853Feb 29, 2024Updated 2 years ago
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an imageβ32,642Feb 18, 2026Updated last week
- Generative Models by Stability AIβ26,930Dec 16, 2025Updated 2 months ago
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"β8,382May 31, 2024Updated last year
- π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal modelβ¦β157,071Updated this week
- Using Low-rank adaptation to quickly fine-tune diffusion models.β7,525Mar 22, 2024Updated last year
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoiβ¦β53,497Sep 18, 2024Updated last year
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β9,513Updated this week
- A collection of resources and papers on Diffusion Modelsβ12,273Aug 1, 2024Updated last year
- Hackable and optimized Transformers building blocks, supporting a composable construction.β10,353Feb 20, 2026Updated last week
- An open source implementation of CLIP.β13,397Feb 20, 2026Updated last week
- β7,305Jul 2, 2024Updated last year
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β20,678Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.β24,478Aug 12, 2024Updated last year
- Stable Diffusion web UIβ161,110Dec 18, 2025Updated 2 months ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β41,648Updated this week
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --β¦β36,397Updated this week
- Fast and memory-efficient exact attentionβ22,361Updated this week
- Official implementation of AnimateDiff.β12,025Jul 31, 2024Updated last year
- Build and share delightful machine learning apps, all in Python. π Star to support our work!β41,855Updated this week
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalitiesβ22,033Jan 23, 2026Updated last month
- Official inference repo for FLUX.1 modelsβ25,225Jul 31, 2025Updated 6 months ago
- Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and β¦β17,409Sep 5, 2024Updated last year
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.β6,471Jun 28, 2024Updated last year
- LAVIS - A One-stop Library for Language-Vision Intelligenceβ11,167Nov 18, 2024Updated last year
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)β12,449Nov 4, 2025Updated 3 months ago
- Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.β8,808Dec 10, 2023Updated 2 years ago
- Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusionβ7,751Dec 8, 2022Updated 3 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMsβ71,234Updated this week
- Open-Sora: Democratizing Efficient Video Production for Allβ28,604Apr 30, 2025Updated 10 months ago
- Official repo for consistency models.β6,477Mar 22, 2024Updated last year
- Making large AI models cheaper, faster and more accessibleβ41,359Updated this week
- β3,438May 14, 2024Updated last year
- Inference code for Llama modelsβ59,166Jan 26, 2025Updated last year
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.β104,246Updated this week
- β6,881Mar 3, 2024Updated last year
- T2I-Adapterβ3,793Jun 21, 2024Updated last year