sayakpaul / caption-upsamplingLinks
This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.
☆158Updated 2 years ago
Alternatives and similar repositories for caption-upsampling
Users that are interested in caption-upsampling are comparing it to the libraries listed below
Sorting:
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆179Updated last year
- ☆321Updated last year
- ☆61Updated last year
- ☆128Updated 2 months ago
- ☆442Updated last year
- Official PyTorch codes for the paper: "ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation"☆244Updated last year
- ☆86Updated 2 years ago
- Implementation of HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models☆175Updated 2 years ago
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆86Updated 2 years ago
- Faster generation with text-to-image diffusion models.☆231Updated 6 months ago
- ☆126Updated 9 months ago
- Implementation of Key-Locked Rank One Editing, from Nvidia AI☆237Updated 2 years ago
- Diffusion Reinforcement Learning Library☆192Updated last year
- Unofficial implementation. Stable diffusion model trained by AI Feedback-Based Self-Training Direct Preference Optimization.☆65Updated last year
- ☆50Updated 2 years ago
- Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"☆558Updated 2 years ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆434Updated 2 years ago
- ☆73Updated 2 years ago
- Tiny optimized Stable-diffusion that can run on GPUs with just 1GB of VRAM. (Beta)☆180Updated 2 years ago
- ☆118Updated 3 years ago
- [inactive] MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆13Updated last year
- Jupyter Notebooks for experimenting with negative prompting with Stable Diffusion 2.0.☆87Updated 3 years ago
- Mixture of Diffusers for scene composition and high resolution image generation☆447Updated 2 years ago
- Official Implementation for "ConceptLab: Creative Generation using Diffusion Prior Constraints"☆254Updated 2 years ago
- Official implementation of the NeurIPS 2023 paper "Photoswap: Personalized Subject Swapping in Images"☆348Updated last year
- Apply controlnet to video clips☆82Updated last year
- Merge safetensor files using the technique described in "Language Models are Super Mario: Absorbing Abilities from Homologous Models as a…☆82Updated last year
- Improved AnimateDiff with a number of improvements☆41Updated last year
- Code for instruction-tuning Stable Diffusion.☆247Updated last year
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆82Updated last year