An implementation of simple diffusion in PyTorch (and JAX)
☆34Jan 28, 2023Updated 3 years ago
Alternatives and similar repositories for simple-diffusion
Users that are interested in simple-diffusion are comparing it to the libraries listed below
Sorting:
- Course repository for the Spring 2023 COMP664 course "Deep Learning" at UNC☆14Apr 17, 2023Updated 2 years ago
- JAX implementation ViT-VQGAN☆82Sep 21, 2022Updated 3 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.☆13Sep 13, 2024Updated last year
- Unofficial implementation for the paper 'Improving Diffusion Models for Inverse Problems using Manifold Constraints'[https://arxiv.org/ab…☆12Aug 21, 2022Updated 3 years ago
- ☆14Sep 20, 2023Updated 2 years ago
- Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models☆30Sep 6, 2025Updated 6 months ago
- Object recognition with Pepper using a deep learning model☆10Sep 16, 2021Updated 4 years ago
- visual-text to speech☆14Apr 3, 2022Updated 3 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- Unsupervised Speech Decomposition via Triple Information Bottleneck☆14Apr 29, 2020Updated 5 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- ☆14Jun 14, 2020Updated 5 years ago
- S$2$CycleDiff: Spatial-Spectral-Bilateral Cycle-Diffusion Frameworkfor Hyperspectral Image Super-Resolution(AAAI 2024)☆15Aug 14, 2025Updated 7 months ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆28Aug 4, 2023Updated 2 years ago
- Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion☆104Mar 10, 2026Updated last week
- ☆15Jul 29, 2022Updated 3 years ago
- Label images with LabelImg; Object detection with detectron2☆13Aug 20, 2021Updated 4 years ago
- A curated resources on what's happening in multimodal learning. Features recent papers, books, related lectures, and other relevant resou…☆16Apr 28, 2023Updated 2 years ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Apr 11, 2021Updated 4 years ago
- 📃 A curated list of all possible resources (tools, tutorials, platforms, etc) an andrew email can get you☆13Nov 15, 2024Updated last year
- An implementation of 'simple diffusion: End-to-end diffusion for high resolution images' as published by Hoogeboom et al.☆40Feb 9, 2025Updated last year
- ☆26Jun 5, 2024Updated last year
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Mar 29, 2021Updated 4 years ago
- Shuaikai Shi, Lijun Zhang, Jie Chen, "Hyperspectral and Multispectral Image Fusion Using the Conditional Denoising Diffusion Probabilisti…☆14Jul 20, 2023Updated 2 years ago
- A home for audio ML in JAX. Has common features, learnable frontends, pretrained supervised and self-supervised models.☆72Jul 24, 2022Updated 3 years ago
- ☆20Oct 3, 2022Updated 3 years ago
- Radar IQ data processing using the GPU☆10Nov 12, 2025Updated 4 months ago
- ☆17Apr 7, 2022Updated 3 years ago
- Code for generating colinraffel.com and my CV☆16Mar 6, 2026Updated 2 weeks ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- ☆13Jun 23, 2022Updated 3 years ago
- The QUT-NOISE database and protocols☆32Nov 13, 2016Updated 9 years ago
- ☆48Aug 21, 2023Updated 2 years ago
- 学成在线 后端 黑马程序员Java企业级实战开发《学成在线》微服务项目,基于SpringCloud、SpringCloudAlibaba技术栈开发,项目搭建到选课支付学习全通关☆16Jan 27, 2023Updated 3 years ago
- A CSRankings-like index for speech researchers☆35Oct 16, 2024Updated last year
- A simple short URL generator for Laravel Framework.☆10Jun 2, 2021Updated 4 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Jul 16, 2023Updated 2 years ago