π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
β33,005Mar 12, 2026Updated this week
Alternatives and similar repositories for diffusers
Users that are interested in diffusers are comparing it to the libraries listed below
Sorting:
- Let us control diffusion models!β33,723Feb 25, 2024Updated 2 years ago
- A latent text-to-image diffusion modelβ72,681Jun 18, 2024Updated last year
- High-Resolution Image Synthesis with Latent Diffusion Modelsβ13,906Feb 29, 2024Updated 2 years ago
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an imageβ32,781Feb 18, 2026Updated 3 weeks ago
- Generative Models by Stability AIβ27,000Dec 16, 2025Updated 3 months ago
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"β8,410May 31, 2024Updated last year
- π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal modelβ¦β157,783Updated this week
- Using Low-rank adaptation to quickly fine-tune diffusion models.β7,529Mar 22, 2024Updated last year
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoiβ¦β53,627Sep 18, 2024Updated last year
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β9,545Updated this week
- A collection of resources and papers on Diffusion Modelsβ12,288Aug 1, 2024Updated last year
- Hackable and optimized Transformers building blocks, supporting a composable construction.β10,364Feb 20, 2026Updated 3 weeks ago
- An open source implementation of CLIP.β13,496Updated this week
- β7,315Jul 2, 2024Updated last year
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β20,761Mar 10, 2026Updated last week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.β24,543Aug 12, 2024Updated last year
- Stable Diffusion web UIβ161,629Mar 2, 2026Updated 2 weeks ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β41,807Updated this week
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --β¦β36,504Updated this week
- Fast and memory-efficient exact attentionβ22,719Updated this week
- Official implementation of AnimateDiff.β12,059Jul 31, 2024Updated last year
- Build and share delightful machine learning apps, all in Python. π Star to support our work!β42,001Updated this week
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalitiesβ22,046Jan 23, 2026Updated last month
- Official inference repo for FLUX.1 modelsβ25,274Jul 31, 2025Updated 7 months ago
- Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and β¦β17,450Sep 5, 2024Updated last year
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.β6,485Jun 28, 2024Updated last year
- LAVIS - A One-stop Library for Language-Vision Intelligenceβ11,183Nov 18, 2024Updated last year
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)β12,515Nov 4, 2025Updated 4 months ago
- Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusionβ7,746Dec 8, 2022Updated 3 years ago
- Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.β8,819Dec 10, 2023Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMsβ72,827Updated this week
- Open-Sora: Democratizing Efficient Video Production for Allβ28,687Apr 30, 2025Updated 10 months ago
- Official repo for consistency models.β6,476Mar 22, 2024Updated last year
- Making large AI models cheaper, faster and more accessibleβ41,360Mar 9, 2026Updated last week
- β3,443May 14, 2024Updated last year
- Inference code for Llama modelsβ59,221Jan 26, 2025Updated last year
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.β105,651Updated this week
- β6,886Mar 3, 2024Updated 2 years ago
- T2I-Adapterβ3,802Jun 21, 2024Updated last year