Birch-san / imagebind-guided-diffusionLinks
Guide diffusion on ImageBind embedding similarity
☆29Updated 2 years ago
Alternatives and similar repositories for imagebind-guided-diffusion
Users that are interested in imagebind-guided-diffusion are comparing it to the libraries listed below
Sorting:
- ☆27Updated last year
- Generate images from an initial frame and text☆37Updated last year
- ☆24Updated 2 years ago
- ☆23Updated last year
- Animatediff implementation. Includes a ControlNet pipeline.☆19Updated last year
- ☆33Updated 7 months ago
- An implementation of simple diffusion in PyTorch (and JAX)☆35Updated 2 years ago
- A JAX implementation of the continuous time formulation of Consistency Models☆85Updated 2 years ago
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Updated 2 years ago
- ☆24Updated last year
- Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP☆39Updated 2 years ago
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆64Updated 3 years ago
- Let's try and finetune the OpenAI consistency decoder to work for SDXL☆24Updated last year
- Inverts CLIP text embeds to image embeds and visualizes with deep-image-prior.☆35Updated 2 years ago
- ☆39Updated last year
- ☆28Updated 10 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆35Updated 11 months ago
- ☆73Updated 2 years ago
- The implementation for Accelerating Guided Diffusion Sampling with Splitting Numerical Methods (2023)☆48Updated 2 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated last year
- Finetune the 1.4B latent diffusion text2img-large checkpoint from CompVis using deepspeed. (work-in-progress)☆36Updated 3 years ago
- JAX implementation ViT-VQGAN☆83Updated 2 years ago
- WIP Pytorch code for stably training single-step, mode-dropping, deterministic autoencoders☆29Updated last week
- ☆16Updated last year
- ☆26Updated last year
- Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.☆60Updated 3 years ago
- Official implementation of "Is This Loss Informative? Faster Text-to-Image Customization by Tracking Objective Dynamics" (NeurIPS 2023)☆37Updated last year
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆19Updated 4 months ago
- Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified …☆69Updated 6 months ago
- [NeurIPS 2022: Score-Based Modeling Workshop] Multiresolution Textual Inversion☆99Updated 2 years ago