patil-suraj / simple-diffusion
An implementation of simple diffusion in PyTorch (and JAX)
☆35Updated 2 years ago
Alternatives and similar repositories for simple-diffusion:
Users that are interested in simple-diffusion are comparing it to the libraries listed below
- JAX implementation ViT-VQGAN☆83Updated 2 years ago
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆64Updated 3 years ago
- [NeurIPS 2022: Score-Based Modeling Workshop] Multiresolution Textual Inversion☆99Updated 2 years ago
- Implementation of NWT, audio-to-video generation, in Pytorch☆90Updated 3 years ago
- Audio Demo for "FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation"☆20Updated 4 years ago
- Unofficial implementation of Neural Analysis and Synthesis☆7Updated 3 years ago
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Updated 2 years ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆81Updated 10 months ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆18Updated 9 months ago
- ☆16Updated 3 years ago
- Fast Inference in Denoising Diffusion Models via MMD Finetuning☆17Updated last year
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆35Updated 10 months ago
- ☆45Updated last year
- The implementation for Accelerating Guided Diffusion Sampling with Splitting Numerical Methods (2023)☆48Updated 2 years ago
- Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models☆30Updated 2 years ago
- [NeurIPS 2023] Official Implementation: "Consistent Diffusion Models"☆56Updated 2 years ago
- [ECCV 2022 & IJCV 2025] Accelerating Score-based Generative Models with Preconditioned Diffusion Sampling☆57Updated last month
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆115Updated 2 years ago
- Inverts CLIP text embeds to image embeds and visualizes with deep-image-prior.☆35Updated 2 years ago
- Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"☆106Updated 3 years ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆86Updated 6 months ago
- Guide diffusion on ImageBind embedding similarity☆28Updated last year
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆32Updated last year
- Contrastive Language-Audio Pretraining☆15Updated 3 years ago
- The official PyTorch implementation of Fast Diffusion Model☆95Updated last year
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆36Updated last year
- ☆28Updated 3 years ago
- Official implemention for Diffusion Models Are Innate One-Step Generators☆22Updated last month
- ☆21Updated last year
- Official pytorch implementation of I2I translation with low resolution conditioning☆23Updated 3 years ago