π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
β33,085Mar 18, 2026Updated this week
Alternatives and similar repositories for diffusers
Users that are interested in diffusers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Let us control diffusion models!β33,752Feb 25, 2024Updated 2 years ago
- A latent text-to-image diffusion modelβ72,709Jun 18, 2024Updated last year
- High-Resolution Image Synthesis with Latent Diffusion Modelsβ13,924Feb 29, 2024Updated 2 years ago
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"β8,433May 31, 2024Updated last year
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an imageβ32,861Feb 18, 2026Updated last month
- Using Low-rank adaptation to quickly fine-tune diffusion models.β7,529Mar 22, 2024Updated 2 years ago
- Generative Models by Stability AIβ27,024Dec 16, 2025Updated 3 months ago
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β9,563Updated this week
- A collection of resources and papers on Diffusion Modelsβ12,297Aug 1, 2024Updated last year
- β7,318Jul 2, 2024Updated last year
- π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal modelβ¦β158,060Updated this week
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoiβ¦β53,684Sep 18, 2024Updated last year
- Hackable and optimized Transformers building blocks, supporting a composable construction.β10,373Updated this week
- An open source implementation of CLIP.β13,528Mar 12, 2026Updated last week
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β20,841Updated this week
- Stable Diffusion web UIβ161,958Mar 2, 2026Updated 3 weeks ago
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.β24,603Aug 12, 2024Updated last year
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.β6,502Jun 28, 2024Updated last year
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --β¦β36,538Updated this week
- Fast and memory-efficient exact attentionβ22,832Updated this week
- Official implementation of AnimateDiff.β12,067Jul 31, 2024Updated last year
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β41,869Updated this week
- β3,444May 14, 2024Updated last year
- Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and β¦β17,472Sep 5, 2024Updated last year
- Official inference repo for FLUX.1 modelsβ25,311Jul 31, 2025Updated 7 months ago
- Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusionβ7,745Dec 8, 2022Updated 3 years ago
- LAVIS - A One-stop Library for Language-Vision Intelligenceβ11,189Nov 18, 2024Updated last year
- Official repo for consistency models.β6,474Mar 22, 2024Updated 2 years ago
- Build and share delightful machine learning apps, all in Python. π Star to support our work!β42,053Updated this week
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)β12,532Nov 4, 2025Updated 4 months ago
- T2I-Adapterβ3,803Jun 21, 2024Updated last year
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalitiesβ22,059Jan 23, 2026Updated 2 months ago
- Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.β8,820Dec 10, 2023Updated 2 years ago
- Open-Sora: Democratizing Efficient Video Production for Allβ28,728Apr 30, 2025Updated 10 months ago
- β6,887Mar 3, 2024Updated 2 years ago
- Taming Transformers for High-Resolution Image Synthesisβ6,451Jul 30, 2024Updated last year
- β3,051Feb 27, 2023Updated 3 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMsβ73,479Updated this week
- β6,956Updated this week