sayakpaul / caption-upsamplingLinks
This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.
☆151Updated last year
Alternatives and similar repositories for caption-upsampling
Users that are interested in caption-upsampling are comparing it to the libraries listed below
Sorting:
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆175Updated last year
- ☆1Updated 3 months ago
- Implementation of HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models☆173Updated last year
- Official PyTorch codes for the paper: "ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation"☆241Updated last year
- ☆319Updated 8 months ago
- ☆116Updated 2 years ago
- ☆51Updated 2 years ago
- A diffusers based implementation of HyperDreamBooth☆132Updated last year
- ☆128Updated 7 months ago
- Diffusion Reinforcement Learning Library☆185Updated last year
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆128Updated last year
- ☆431Updated last year
- Official Implementation of 'Inserting Anybody in Diffusion Models via Celeb Basis'☆254Updated last year
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆85Updated last year
- ☆54Updated 2 years ago
- This is an unofficial PyTorch implementation of StyleDrop: Text-to-Image Generation in Any Style.☆217Updated last year
- Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"☆538Updated last year
- ☆184Updated last year
- [inactive] MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆13Updated last year
- ☆60Updated last year
- Official implementation of the NeurIPS 2023 paper "Photoswap: Personalized Subject Swapping in Images"☆350Updated last year
- Improved AnimateDiff with a number of improvements☆39Updated last year
- Code for instruction-tuning Stable Diffusion.☆232Updated last year
- Forked version of AnimateDiff, attempts to add init images. If you are look into original repo, please go to https://github.com/guoyww/a…☆150Updated last year
- A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)☆167Updated last month
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆423Updated last year
- Implementation of Key-Locked Rank One Editing, from Nvidia AI☆235Updated last year
- Unofficial implementation. Stable diffusion model trained by AI Feedback-Based Self-Training Direct Preference Optimization.☆64Updated last year
- ☆72Updated 2 years ago
- Official Implementation for "ConceptLab: Creative Generation using Diffusion Prior Constraints"☆252Updated last year