StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
☆10,772Dec 4, 2024Updated last year
Alternatives and similar repositories for StreamDiffusion
Users that are interested in StreamDiffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of AnimateDiff.☆12,162Jul 31, 2024Updated last year
- PhotoMaker [CVPR 2024]☆10,098Oct 31, 2024Updated last year
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆4,229Apr 8, 2024Updated 2 years ago
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,856Mar 7, 2025Updated last year
- Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference☆4,616Jun 14, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"☆10,908Aug 29, 2025Updated 9 months ago
- ☆8,681Oct 9, 2024Updated last year
- Generative Models by Stability AI☆27,205Dec 16, 2025Updated 6 months ago
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆11,955Jul 18, 2024Updated last year
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,156Jan 10, 2025Updated last year
- Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation☆14,773Sep 20, 2025Updated 9 months ago
- Official Code for Stable Cascade☆6,548Jul 25, 2024Updated last year
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆36,789Apr 19, 2025Updated last year
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆118,143Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Open-Sora: Democratizing Efficient Video Production for All☆29,143Apr 9, 2026Updated 2 months ago
- Unofficial Implementation of Animate Anyone☆2,926Jul 9, 2024Updated last year
- Let us control diffusion models!☆33,965Feb 25, 2024Updated 2 years ago
- Your image is almost there!☆7,612Jul 26, 2024Updated last year
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆6,604Jun 28, 2024Updated 2 years ago
- This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.☆12,159Mar 8, 2026Updated 3 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,840Feb 1, 2025Updated last year
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models☆5,063Jan 9, 2026Updated 5 months ago
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,430Sep 26, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Enjoy the magic of Diffusion models!☆12,609Jun 21, 2026Updated last week
- Focus on prompting and generating☆50,450Dec 1, 2025Updated 6 months ago
- Character Animation (AnimateAnyone, Face Reenactment)☆3,508May 31, 2024Updated 2 years ago
- FaceChain is a deep-learning toolchain for generating your Digital-Twin.☆9,496Jun 6, 2025Updated last year
- Industry leading face manipulation platform☆29,071Updated this week
- 🔊 Text-Prompted Generative Audio Model☆39,172Aug 19, 2024Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆23,393Mar 3, 2026Updated 3 months ago
- App showcasing multiple real-time diffusion models pipelines with Diffusers☆917Sep 27, 2025Updated 9 months ago
- High-speed Large Language Model Serving for Local Deployment☆9,586May 11, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official implementation of DreaMoving☆1,791Jan 9, 2024Updated 2 years ago
- More relighting!☆8,450Feb 20, 2025Updated last year
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆9,865Mar 25, 2026Updated 3 months ago
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆24,881Aug 12, 2024Updated last year
- Code and dataset for photorealistic Codec Avatars driven from audio☆2,857Sep 15, 2024Updated last year
- Let us democratise high-resolution generation! (CVPR 2024)☆2,040Oct 10, 2025Updated 8 months ago
- [WIP] Layer Diffusion for WebUI (via Forge)☆4,114Aug 30, 2024Updated last year