Birch-san / imagebind-guided-diffusion
Guide diffusion on ImageBind embedding similarity
☆28Updated last year
Related projects ⓘ
Alternatives and complementary repositories for imagebind-guided-diffusion
- Animatediff implementation. Includes a ControlNet pipeline.☆19Updated 10 months ago
- Generate images from an initial frame and text☆37Updated last year
- ☆71Updated last year
- Let's try and finetune the OpenAI consistency decoder to work for SDXL☆23Updated 11 months ago
- ☆28Updated 2 weeks ago
- ☆26Updated 5 months ago
- A JAX implementation of the continuous time formulation of Consistency Models☆83Updated last year
- ☆21Updated 5 months ago
- Inverts CLIP text embeds to image embeds and visualizes with deep-image-prior.☆35Updated 2 years ago
- ☆26Updated 6 months ago
- ☆27Updated 3 months ago
- ☆40Updated this week
- Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP☆39Updated last year
- Repository with which to explore k-diffusion and diffusers, and within which changes to said packages may be tested.☆55Updated 9 months ago
- ☆24Updated 5 months ago
- [NeurIPS 2022: Score-Based Modeling Workshop] Multiresolution Textual Inversion☆98Updated last year
- Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified …☆64Updated last week
- Official repository for VQDM:Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization paper☆29Updated 2 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆30Updated 4 months ago
- Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.☆59Updated 2 years ago
- ☆24Updated last year
- 🎨 Fill in masked parts of images with FLUX.1-dev 🖌️☆27Updated 3 months ago
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆48Updated this week
- Public code release for the paper "ProCreate, Don’t Reproduce! Propulsive Energy Diffusion for Creative Generation"☆34Updated last week
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated 7 months ago
- ☆33Updated 6 months ago