sayakpaul / caption-upsampling
This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.
☆153Updated last year
Alternatives and similar repositories for caption-upsampling:
Users that are interested in caption-upsampling are comparing it to the libraries listed below
- ☆322Updated 5 months ago
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆84Updated last year
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆169Updated 10 months ago
- Implementation of HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models☆167Updated last year
- Official PyTorch codes for the paper: "ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation"☆239Updated 11 months ago
- ☆118Updated 2 years ago
- A diffusers based implementation of HyperDreamBooth☆128Updated last year
- ☆60Updated 9 months ago
- ☆125Updated 4 months ago
- Updated last year
- Code for instruction-tuning Stable Diffusion.☆221Updated last year
- Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"☆527Updated last year
- ☆416Updated 10 months ago
- ☆125Updated 2 months ago
- ☆54Updated last year
- ☆173Updated 10 months ago
- ☆183Updated last year
- Official Implementation of weights2weights☆138Updated 2 months ago
- Jupyter Notebooks for experimenting with negative prompting with Stable Diffusion 2.0.☆88Updated 2 years ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆410Updated last year
- ☆52Updated last year
- Official Implementation of 'Inserting Anybody in Diffusion Models via Celeb Basis'☆255Updated last year
- Diffusion Reinforcement Learning Library☆179Updated last year
- Faster generation with text-to-image diffusion models.☆210Updated 4 months ago
- Fast finetuning using a booster model that puts the initial state to a local minimum☆113Updated last year
- Recaption large (Web)Datasets with vllm and save the artifacts.☆44Updated 2 months ago
- ☆86Updated last year
- Various training scripts used to train bigasp☆76Updated 3 months ago
- Diffusers Implementation of Controlling Text-to-Image Diffusion by Orthogonal Finetuning☆35Updated last year