sayakpaul / caption-upsampling
This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.
☆152Updated last year
Alternatives and similar repositories for caption-upsampling:
Users that are interested in caption-upsampling are comparing it to the libraries listed below
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆175Updated last year
- ☆321Updated 7 months ago
- Merge safetensor files using the technique described in "Language Models are Super Mario: Absorbing Abilities from Homologous Models as a…☆79Updated 6 months ago
- Implementation of HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models☆172Updated last year
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆86Updated last year
- ☆118Updated 2 years ago
- ☆427Updated last year
- Updated 2 months ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆420Updated last year
- ☆127Updated 7 months ago
- Faster generation with text-to-image diffusion models.☆213Updated 7 months ago
- ☆52Updated 2 years ago
- Official PyTorch codes for the paper: "ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation"☆242Updated last year
- Diffusion Reinforcement Learning Library☆184Updated last year
- ☆125Updated 2 months ago
- This is a Gradio WebUI working with the Diffusers format of Stable Diffusion☆80Updated 2 years ago
- ☆73Updated 2 years ago
- Official Implementation of 'Inserting Anybody in Diffusion Models via Celeb Basis'☆255Updated last year
- Official Implementation for "ConceptLab: Creative Generation using Diffusion Prior Constraints"☆251Updated last year
- Jupyter Notebooks for experimenting with negative prompting with Stable Diffusion 2.0.☆87Updated 2 years ago
- sd3 dreambooth lora training book, adapted from the diffusers doc☆45Updated 10 months ago
- ☆185Updated last year
- Official Implementation of weights2weights☆141Updated 2 months ago
- Various training scripts used to train bigasp☆81Updated 6 months ago
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆129Updated last year
- ☆86Updated 2 years ago
- Diffusers Implementation of Controlling Text-to-Image Diffusion by Orthogonal Finetuning☆35Updated last year
- ☆54Updated 2 years ago
- Official implementation of the NeurIPS 2023 paper "Photoswap: Personalized Subject Swapping in Images"☆350Updated last year
- A detailed diagram laying out the full Flux.1 [dev] architecture as shared by Black Forest Labs at https://github.com/black-forest-labs/f…☆53Updated 6 months ago