patil-suraj / simple-diffusion
An implementation of simple diffusion in PyTorch (and JAX)
☆35Updated 2 years ago
Alternatives and similar repositories for simple-diffusion:
Users that are interested in simple-diffusion are comparing it to the libraries listed below
- JAX implementation ViT-VQGAN☆80Updated 2 years ago
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Updated 2 years ago
- Implementation of NWT, audio-to-video generation, in Pytorch☆88Updated 2 years ago
- Fast Inference in Denoising Diffusion Models via MMD Finetuning☆16Updated last year
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆64Updated 2 years ago
- [NeurIPS 2022: Score-Based Modeling Workshop] Multiresolution Textual Inversion☆99Updated 2 years ago
- Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)☆38Updated 3 months ago
- Unofficial implementation of Neural Analysis and Synthesis☆7Updated 3 years ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆79Updated 7 months ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆16Updated 6 months ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆83Updated 4 months ago
- Official pytorch implementation of I2I translation with low resolution conditioning☆23Updated 3 years ago
- Audio Demo for "FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation"☆20Updated 3 years ago
- ☆16Updated 3 years ago
- Implementation of Taming Transformers for High-Resolution Image Synthesis (https://arxiv.org/abs/2012.09841) in PyTorch☆16Updated 4 years ago
- Inverts CLIP text embeds to image embeds and visualizes with deep-image-prior.☆35Updated 2 years ago
- Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis☆39Updated last year
- Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"☆99Updated 2 years ago
- Guide diffusion on ImageBind embedding similarity☆28Updated last year
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆36Updated 10 months ago
- ☆28Updated 3 years ago
- Contrastive Language-Audio Pretraining☆15Updated 3 years ago
- Pedagogical codebase for a simplified score-based generative model design, with training loop☆39Updated 3 years ago
- ☆21Updated 3 weeks ago
- A convolution-free, transformer-only version of the CycleGAN framework☆33Updated 3 years ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆111Updated 2 years ago
- PyTorch implementation of Contrastive Feature Loss for Image Prediction (AIM Workshop at ICCV 2021)☆53Updated 3 years ago
- [ICLR 2024] Official code for the paper 'Elucidating the Exposure Bias in Diffusion Models'☆40Updated last week
- The implementation for Accelerating Guided Diffusion Sampling with Splitting Numerical Methods (2023)☆48Updated last year
- Official PyTorch implementation for FastDPM, a fast sampling algorithm for diffusion probabilistic models☆82Updated 3 years ago