patil-suraj/simple-diffusion

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/patil-suraj/simple-diffusion)

patil-suraj / simple-diffusion

An implementation of simple diffusion in PyTorch (and JAX)

☆34

Alternatives and similar repositories for simple-diffusion

Users that are interested in simple-diffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

craffel / comp664-deep-learning-spring-2023
View on GitHub
Course repository for the Spring 2023 COMP664 course "Deep Learning" at UNC
☆14Apr 17, 2023Updated 3 years ago
webaverse / LJSpeechTools
View on GitHub
Tools to isolate speaker and transcribe unstructured audio clips
☆11Dec 4, 2022Updated 3 years ago
patil-suraj / vit-vqgan
View on GitHub
JAX implementation ViT-VQGAN
☆82Sep 21, 2022Updated 3 years ago
revsic / torch-retriever-vc
View on GitHub
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Jan 27, 2023Updated 3 years ago
adefossez / audio_mod_idessai
View on GitHub
Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.
☆13Sep 13, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
gzhu06 / Manifold-Constrained-Gradient-ipynb
View on GitHub
Unofficial implementation for the paper 'Improving Diffusion Models for Inverse Problems using Manifold Constraints'[https://arxiv.org/ab…
☆12Aug 21, 2022Updated 3 years ago
Vaibhavs10 / dcase-2023-workshop
View on GitHub
☆14Sep 20, 2023Updated 2 years ago
kynkaat / guidance-interval
View on GitHub
Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models
☆29Sep 6, 2025Updated 10 months ago
Yoshifumi-Nakano / visual-text-to-speech
View on GitHub
visual-text to speech
☆14Apr 3, 2022Updated 4 years ago
softbankrobotics-labs / pepper-deep-learning
View on GitHub
Object recognition with Pepper using a deep learning model
☆10Sep 16, 2021Updated 4 years ago
jinglescode / papers
View on GitHub
Summaries of machine learning papers
☆12Aug 19, 2022Updated 3 years ago
vliu15 / adversarial-tts
View on GitHub
End-to-end Text-to-Speech with Generative Adversarial Networks
☆20Feb 6, 2021Updated 5 years ago
karchkha / MelSpec_GPT_VQVAE
View on GitHub
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Oct 8, 2023Updated 2 years ago
auspicious3000 / SpeechSplit-Demo
View on GitHub
Unsupervised Speech Decomposition via Triple Information Bottleneck
☆14Apr 29, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lifeiteng / VoiceBox
View on GitHub
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
☆29Aug 4, 2023Updated 2 years ago
gnhdnb / adjustable-real-time-style-transfer
View on GitHub
☆14Jun 14, 2020Updated 6 years ago
melkor169 / CP_Drums_Generation
View on GitHub
☆15Jul 29, 2022Updated 3 years ago
shuaikaishi / DDPMFus
View on GitHub
Shuaikai Shi, Lijun Zhang, Jie Chen, "Hyperspectral and Multispectral Image Fusion Using the Conditional Denoising Diffusion Probabilisti…
☆14Jul 20, 2023Updated 3 years ago
bshall / acoustic-model
View on GitHub
Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
☆104Mar 10, 2026Updated 4 months ago
kvsnoufal / dashcam-object-detection
View on GitHub
Label images with LabelImg; Object detection with detectron2
☆13Aug 20, 2021Updated 4 years ago
xinshengwang / ICASSP2021_paper_list-VC
View on GitHub
ICASSP 2021 accepted papers in term of voice conversion (VC)
☆18Apr 11, 2021Updated 5 years ago
gkioxari / aims2022
View on GitHub
☆17Apr 7, 2022Updated 4 years ago
rosinality / instant-ngp-pytorch
View on GitHub
Study for Instant neural graphics primitives (Unofficial)
☆11Jan 18, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bastibe / MAPS-Scripts
View on GitHub
A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.
☆25Mar 29, 2021Updated 5 years ago
Dapwner / CVAE-Tacotron
View on GitHub
☆26Jun 5, 2024Updated 2 years ago
SarthakYadav / audax
View on GitHub
A home for audio ML in JAX. Has common features, learnable frontends, pretrained supervised and self-supervised models.
☆72Jul 24, 2022Updated 3 years ago
LukeWood / reef-net
View on GitHub
☆24Sep 2, 2022Updated 3 years ago
rcjackson / HighIQ
View on GitHub
Radar IQ data processing using the GPU
☆11Nov 12, 2025Updated 8 months ago
craffel / craffel.github.io
View on GitHub
Code for generating colinraffel.com and my CV
☆16Updated this week
pprablanc / ppsrt
View on GitHub
A python algorithm to change the pitch of the voice in real time
☆13Dec 13, 2020Updated 5 years ago
plassma / symbolic-music-discrete-diffusion
View on GitHub
☆50Aug 21, 2023Updated 2 years ago
crux82 / msr-vtt-it
View on GitHub
A large scale dataset for Video Captioning in Italian
☆13May 16, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lamblin / awesome-deep-learning-papers
View on GitHub
☆16Jun 4, 2016Updated 10 years ago
qutsaivt / QUT-NOISE
View on GitHub
The QUT-NOISE database and protocols
☆32Nov 13, 2016Updated 9 years ago
Aria-K-Alethia / laughter-synthesis
View on GitHub
Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…
☆77Jul 16, 2023Updated 3 years ago
lucidrains / CLAP
View on GitHub
Contrastive Language-Audio Pretraining
☆15May 18, 2021Updated 5 years ago
hearbenchmark / hear-eval-kit
View on GitHub
Evaluation kit for the HEAR Benchmark
☆65Feb 12, 2026Updated 5 months ago
mutiann / speech_rankings
View on GitHub
A CSRankings-like index for speech researchers
☆35Oct 16, 2024Updated last year
Infinity-INF / fast-phasr
View on GitHub
Phonemes and durations labeling based on whisper small
☆11Jul 7, 2024Updated 2 years ago