segmind / SSD-1B
SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.
☆166Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for SSD-1B
- ☆406Updated 7 months ago
- IP Adapter Instruct☆182Updated 3 months ago
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆153Updated last year
- ☆317Updated last month
- ☆393Updated 7 months ago
- Training-free Regional Prompting for Diffusion Transformers 🔥☆267Updated this week
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆84Updated 10 months ago
- Keyframe Interpolation with CogvideoX☆79Updated last week
- Implementation of HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models☆165Updated last year
- Official Repository of the paper "Trajectory Consistency Distillation"☆318Updated 6 months ago
- Official Implementation for "ConceptLab: Creative Generation using Diffusion Prior Constraints"☆245Updated 10 months ago
- Implicit Style-Content Separation using B-LoRA☆298Updated last month
- ☆110Updated 2 years ago
- ☆245Updated 10 months ago
- [SIGGRAPH Asia 2024 (Journal Track)]StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter☆193Updated 3 months ago
- Memory optimized finetuning scripts for CogVideoX using TorchAO and DeepSpeed☆345Updated this week
- ☆122Updated last month
- ☆262Updated 3 months ago
- Official Implementation of 'Inserting Anybody in Diffusion Models via Celeb Basis'☆253Updated last year
- implementation of the IPAdapter models for HF Diffusers☆165Updated last year
- JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for th…☆119Updated 3 weeks ago
- Forked version of AnimateDiff, attempts to add init images. If you are look into original repo, please go to https://github.com/guoyww/a…☆154Updated last year
- MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆190Updated 3 months ago
- ☆182Updated last year
- a CLI utility/library for AnimateDiff stable diffusion generation☆263Updated this week
- Official PyTorch codes for the paper: "ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation"☆238Updated 7 months ago
- Reference-Based Modulation (RB-Modulation)☆124Updated 2 months ago
- Faster LCM is a script which enables to transfer image styles at 45fps with RTX4090, 33fps with A100.☆93Updated 11 months ago
- The best OSS video generation models☆117Updated 2 weeks ago