sayakpaul / caption-upsampling
This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.
☆149Updated 10 months ago
Related projects: ⓘ
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆164Updated 5 months ago
- Implementation of HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models☆160Updated last year
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆84Updated 8 months ago
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆172Updated this week
- ☆118Updated 2 weeks ago
- IP Adapter Instruct☆175Updated last month
- ☆117Updated 3 months ago
- ☆314Updated last week
- Official PyTorch codes for the paper: "ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation"☆236Updated 5 months ago
- Official Implementation of weights2weights☆98Updated last week
- An AI focused photo manipulation tool based on Gradio☆126Updated this week
- Diffusion Reinforcement Learning Library☆171Updated 7 months ago
- ☆398Updated 5 months ago
- Code repository for T2V-Turbo☆166Updated 2 months ago
- Faster generation with text-to-image diffusion models.☆181Updated 4 months ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆90Updated 2 months ago
- Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"☆311Updated last week
- Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"☆502Updated 8 months ago
- ☆0Updated 9 months ago
- Fine-tuning code for CLIP models☆120Updated this week
- This is an unofficial PyTorch implementation of StyleDrop: Text-to-Image Generation in Any Style.☆203Updated last year
- ☆52Updated last year
- Official Implementation of 'Inserting Anybody in Diffusion Models via Celeb Basis'☆252Updated 11 months ago
- Official implementation of the NeurIPS 2023 paper "Photoswap: Personalized Subject Swapping in Images"☆338Updated 6 months ago
- MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆180Updated 2 months ago
- ☆188Updated 8 months ago
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆70Updated last month
- Implicit Style-Content Separation using B-LoRA☆282Updated 3 months ago
- ☆85Updated last year
- [SIGGRAPH Asia 2023] An interactive story visualization tool that support multiple characters☆251Updated 5 months ago