Birch-san / imagebind-guided-diffusionLinks
Guide diffusion on ImageBind embedding similarity
☆29Updated 2 years ago
Alternatives and similar repositories for imagebind-guided-diffusion
Users that are interested in imagebind-guided-diffusion are comparing it to the libraries listed below
Sorting:
- Generate images from an initial frame and text☆36Updated last year
- ☆24Updated 2 years ago
- Animatediff implementation. Includes a ControlNet pipeline.☆18Updated last year
- Let's try and finetune the OpenAI consistency decoder to work for SDXL☆24Updated last year
- ☆27Updated last year
- ☆33Updated 7 months ago
- ☆23Updated 11 months ago
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆64Updated 3 years ago
- Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP☆38Updated 2 years ago
- ☆28Updated 10 months ago
- Inverts CLIP text embeds to image embeds and visualizes with deep-image-prior.☆35Updated 2 years ago
- ☆72Updated 2 years ago
- A JAX implementation of the continuous time formulation of Consistency Models☆84Updated 2 years ago
- An implementation of simple diffusion in PyTorch (and JAX)☆35Updated 2 years ago
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch☆50Updated 2 years ago
- Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.☆60Updated 3 years ago
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Updated 2 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆15Updated last year
- Fine-tune of Florence-2 for shot categorization.☆24Updated 2 months ago
- ☆26Updated 11 months ago
- ☆16Updated last year
- [NeurIPS 2022: Score-Based Modeling Workshop] Multiresolution Textual Inversion☆98Updated 2 years ago
- WIP Pytorch code for stably training single-step, mode-dropping, deterministic autoencoders☆27Updated last year
- A fast approach for translating a series of text prompts into a video. The 2022 NeurIPS Workshop on Machine Learning for Creativity and D…☆32Updated last year
- Finetune the 1.4B latent diffusion text2img-large checkpoint from CompVis using deepspeed. (work-in-progress)☆36Updated 3 years ago
- ☆24Updated 11 months ago
- Public code release for the paper "ProCreate, Don’t Reproduce! Propulsive Energy Diffusion for Creative Generation"☆38Updated 3 weeks ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆35Updated 11 months ago
- Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified …☆70Updated 6 months ago
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆19Updated 4 months ago