Stability-AI / generative-models
Generative Models by Stability AI
β25,730Updated 3 weeks ago
Alternatives and similar repositories for generative-models:
Users that are interested in generative-models are comparing it to the libraries listed below
- Let us control diffusion models!β32,125Updated last year
- π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.β28,700Updated this week
- StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generationβ10,148Updated 4 months ago
- Official implementation of AnimateDiff.β11,319Updated 8 months ago
- A Gradio web UI for Large Language Models with support for multiple inference backends.β43,274Updated this week
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Modelsβ4,807Updated 9 months ago
- [CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Modelβ10,720Updated 10 months ago
- FaceChain is a deep-learning toolchain for generating your Digital-Twin.β9,383Updated 2 weeks ago
- InstantID: Zero-shot Identity-Preserving Generation in Seconds π₯β11,575Updated 9 months ago
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.β74,999Updated this week
- Nightly release of ControlNet 1.1β4,972Updated 8 months ago
- Focus on prompting and generatingβ44,415Updated 3 months ago
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.β5,874Updated 9 months ago
- β10,509Updated this week
- Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animationβ14,725Updated 2 months ago
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)β47,518Updated this week
- Create π₯ videos with Stable Diffusion by exploring the latent space and morphing between text promptsβ4,569Updated 7 months ago
- StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, β¦β4,761Updated last month
- Using Low-rank adaptation to quickly fine-tune diffusion models.β7,314Updated last year
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.β22,283Updated 8 months ago
- π Text-Prompted Generative Audio Modelβ37,539Updated 8 months ago
- [ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generatorsβ4,176Updated last year
- Build and share delightful machine learning apps, all in Python. π Star to support our work!β37,616Updated this week
- WebUI extension for ControlNetβ17,542Updated 8 months ago
- Open-Sora: Democratizing Efficient Video Production for Allβ26,205Updated 3 weeks ago
- High-Resolution Image Synthesis with Latent Diffusion Modelsβ40,820Updated 6 months ago
- Hackable and optimized Transformers building blocks, supporting a composable construction.β9,372Updated last week
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)β¦β13,531Updated 2 weeks ago
- T2I-Adapterβ3,661Updated 10 months ago
- [CVPR 2023] SadTalkerοΌLearning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animationβ12,614Updated 9 months ago