☆7,843Apr 14, 2024Updated last year
Alternatives and similar repositories for IF
Users that are interested in IF are comparing it to the libraries listed below
Sorting:
- Let us control diffusion models!☆33,663Feb 25, 2024Updated 2 years ago
- StableLM: Stability AI Language Models☆15,761Apr 8, 2024Updated last year
- AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head☆10,207Jul 6, 2024Updated last year
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.☆32,873Updated this week
- Official repo for consistency models.☆6,477Mar 22, 2024Updated last year
- Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.☆8,808Dec 10, 2023Updated 2 years ago
- T2I-Adapter☆3,793Jun 21, 2024Updated last year
- Using Low-rank adaptation to quickly fine-tune diffusion models.☆7,525Mar 22, 2024Updated last year
- Generative Models by Stability AI☆26,930Dec 16, 2025Updated 2 months ago
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models☆5,032Jan 9, 2026Updated last month
- Kandinsky 2 — multilingual text2image latent diffusion model☆2,821May 1, 2024Updated last year
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)☆25,760Sep 2, 2024Updated last year
- Consistency Distilled Diff VAE☆2,209Nov 7, 2023Updated 2 years ago
- Official implementation of AnimateDiff.☆12,038Jul 31, 2024Updated last year
- ☆6,881Mar 3, 2024Updated last year
- ImageBind One Embedding Space to Bind Them All☆8,980Nov 21, 2025Updated 3 months ago
- 🔊 Text-Prompted Generative Audio Model☆39,006Aug 19, 2024Updated last year
- Community interface for generative AI☆9,057Apr 30, 2024Updated last year
- Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …☆17,409Sep 5, 2024Updated last year
- Generate 3D objects conditioned on text or images☆12,209Jun 22, 2024Updated last year
- Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI…☆6,939Dec 13, 2025Updated 2 months ago
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,560Dec 26, 2023Updated 2 years ago
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…☆53,497Sep 18, 2024Updated last year
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆24,478Aug 12, 2024Updated last year
- Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)☆3,431Feb 23, 2025Updated last year
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…☆37,444Aug 17, 2024Updated last year
- A unified framework for 3D content generation.☆6,981Dec 16, 2024Updated last year
- Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference☆4,615Jun 14, 2024Updated last year
- [ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators☆4,244May 6, 2023Updated 2 years ago
- Official Code for DragGAN (SIGGRAPH 2023)☆35,979May 18, 2024Updated last year
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,476Jun 7, 2025Updated 8 months ago
- ☆3,049Feb 27, 2023Updated 3 years ago
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆6,471Jun 28, 2024Updated last year
- [NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"☆4,772Aug 19, 2024Updated last year
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,281Oct 31, 2024Updated last year
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,414Jun 2, 2025Updated 8 months ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆11,167Nov 18, 2024Updated last year
- [CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing☆4,864Apr 7, 2024Updated last year
- PyTorch code and models for the DINOv2 self-supervised learning method.☆12,427Updated this week