sayakpaul / caption-upsamplingLinks
This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.
☆156Updated last year
Alternatives and similar repositories for caption-upsampling
Users that are interested in caption-upsampling are comparing it to the libraries listed below
Sorting:
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆177Updated last year
- ☆61Updated last year
- ☆321Updated last year
- ☆52Updated 2 years ago
- Official PyTorch codes for the paper: "ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation"☆245Updated last year
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆86Updated last year
- ☆86Updated 2 years ago
- Implementation of HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models☆175Updated 2 years ago
- ☆435Updated last year
- Diffusion Reinforcement Learning Library☆191Updated last year
- Unofficial implementation. Stable diffusion model trained by AI Feedback-Based Self-Training Direct Preference Optimization.☆65Updated last year
- Implementation of Key-Locked Rank One Editing, from Nvidia AI☆235Updated 2 years ago
- A diffusers based implementation of HyperDreamBooth☆137Updated 2 years ago
- ☆118Updated 3 years ago
- ☆128Updated 11 months ago
- Fast finetuning using a booster model that puts the initial state to a local minimum☆114Updated 2 years ago
- ☆73Updated 2 years ago
- Faster generation with text-to-image diffusion models.☆226Updated 2 months ago
- Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"☆545Updated last year
- Official Implementation for "ConceptLab: Creative Generation using Diffusion Prior Constraints"☆255Updated last year
- Fine-tuning of diffusion models☆99Updated 2 years ago
- ☆126Updated 6 months ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆426Updated 2 years ago
- ControlNet control image preprocess library☆15Updated 2 years ago
- Iterable datapipelines for pytorch training.☆87Updated last year
- Tiny optimized Stable-diffusion that can run on GPUs with just 1GB of VRAM. (Beta)☆176Updated 2 years ago
- [inactive] MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆13Updated last year
- Jupyter Notebooks for experimenting with negative prompting with Stable Diffusion 2.0.☆87Updated 2 years ago
- Repository with which to explore k-diffusion and diffusers, and within which changes to said packages may be tested.☆53Updated last year
- Apply controlnet to video clips☆81Updated 10 months ago