sayakpaul / caption-upsampling
This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.
☆152Updated last year
Alternatives and similar repositories for caption-upsampling:
Users that are interested in caption-upsampling are comparing it to the libraries listed below
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆172Updated 11 months ago
- ☆321Updated 6 months ago
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆85Updated last year
- ☆426Updated 11 months ago
- ☆126Updated 5 months ago
- Updated 3 weeks ago
- Implementation of HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models☆169Updated last year
- ☆52Updated 2 years ago
- ☆118Updated 2 years ago
- ☆22Updated last year
- Unofficial implementation. Stable diffusion model trained by AI Feedback-Based Self-Training Direct Preference Optimization.☆61Updated last year
- ☆90Updated last year
- ☆86Updated 2 years ago
- Official PyTorch codes for the paper: "ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation"☆241Updated last year
- Faster LCM is a script which enables to transfer image styles at 45fps with RTX4090, 33fps with A100.☆95Updated last year
- ☆124Updated 2 weeks ago
- Official Implementation of 'Inserting Anybody in Diffusion Models via Celeb Basis'☆255Updated last year
- Diffusion Reinforcement Learning Library☆181Updated last year
- Diffusers Implementation of Controlling Text-to-Image Diffusion by Orthogonal Finetuning☆35Updated last year
- [SIGGRAPH Asia 2023] An interactive story visualization tool that support multiple characters☆261Updated last year
- A diffusers based implementation of HyperDreamBooth☆129Updated last year
- Jupyter Notebooks for experimenting with negative prompting with Stable Diffusion 2.0.☆87Updated 2 years ago
- ☆54Updated 2 years ago
- Fast finetuning using a booster model that puts the initial state to a local minimum☆113Updated last year
- A detailed diagram laying out the full Flux.1 architecture as shared by Black Forest Labs at https://github.com/black-forest-labs/flux.☆49Updated 5 months ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆413Updated last year
- Fork of Controlnet for 2 input channels☆60Updated last year
- Official Implementation of weights2weights☆140Updated 2 weeks ago