Harmonai-org/audio-diffusion-pytorch-fork

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Harmonai-org/audio-diffusion-pytorch-fork)

Harmonai-org / audio-diffusion-pytorch-fork

Audio generation using diffusion models, in PyTorch.

☆49

Alternatives and similar repositories for audio-diffusion-pytorch-fork

Users that are interested in audio-diffusion-pytorch-fork are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Harmonai-org / oobleck
View on GitHub
open soundstream-ish VAE codecs for downstream neural audio synthesis
☆124Jun 12, 2023Updated 3 years ago
diontimmer / sample-diffusion-gui
View on GitHub
GUI toolkit using various audio diffusion repos.
☆76Jul 27, 2023Updated 2 years ago
maxrmorrison / promonet
View on GitHub
Prosody and Pronunciation Modification Network
☆64May 5, 2025Updated last year
ldzhangyx / MusicMagus
View on GitHub
The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".
☆49Sep 11, 2024Updated last year
gitmylo / bark-data-gen
View on GitHub
Create training data for training a voice cloner for bark text to speech.
☆47Jun 13, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
mll-lab-nu / ENACT
View on GitHub
ENACT is a benchmark that evaluates embodied cognition through world modeling from egocentric interaction. It is designed to be simple an…
☆52Nov 27, 2025Updated 7 months ago
genekogan / youtube-summarizer
View on GitHub
yt_dlp -> whisper -> gpt4
☆17Jan 8, 2024Updated 2 years ago
archinetai / archisound
View on GitHub
A collection of pre-trained audio models, in PyTorch.
☆116Jan 27, 2023Updated 3 years ago
gudgud96 / noisy-student-emotion-training
View on GitHub
Submission to MediaEval 2021 Emotions and Themes in Music challenge. Noisy-student training for music emotion tagging
☆11Dec 2, 2021Updated 4 years ago
Harmonai-org / sample-generator
View on GitHub
Tools to train a generative model on arbitrary audio samples
☆1,116Apr 29, 2024Updated 2 years ago
zhengmidon / singaligner
View on GitHub
a compact audio-to-phoneme aligner for singing voice
☆12Jan 17, 2024Updated 2 years ago
sanderwood / melodyt5
View on GitHub
MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]
☆49Jan 23, 2025Updated last year
sony / FxNorm-automix
View on GitHub
FxNorm-Automix - Implementation of automatic music mixing systems. We show how we can use wet music data and repurpose it to train a full…
☆145Mar 11, 2024Updated 2 years ago
asigalov61 / Yoda
View on GitHub
[DEPRECEATED] Morpheus Music AI implementation spin-off :)
☆16Oct 5, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sakemin / cog-musicgen-fine-tuner
View on GitHub
This is a cog implementation of the fine-tuner for Meta's MusicGen
☆55Apr 5, 2024Updated 2 years ago
ETH-DISCO / cue-detr
View on GitHub
Repository of the ISMIR'24 paper "Cue Point Estimation using Object Detection"
☆30Aug 19, 2024Updated last year
csteinmetz1 / AutomaticMixingPapers
View on GitHub
Important papers and associated code on automatic mixing research
☆110Jun 14, 2026Updated 3 weeks ago
EGiunchiglia / CCN
View on GitHub
Code for paper "Multi-label Classification Neural Networks with Hard Logical Constraints"
☆15Sep 6, 2022Updated 3 years ago
yukara-ikemiya / minimal-sqvae
View on GitHub
A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony
☆33Oct 16, 2023Updated 2 years ago
yangdongchao / ALMTokenizer
View on GitHub
The demo page for ALMTokenizer
☆59Apr 14, 2025Updated last year
Nasdacoin / Nasdacoin
View on GitHub
☆11Nov 10, 2019Updated 6 years ago
haoheliu / SemantiCodec
View on GitHub
☆45Jun 11, 2024Updated 2 years ago
LonicaMewinsky / ComfyUI-MakeFrame
View on GitHub
Custom node for breaking an animation into frames (and keyframes)
☆30May 22, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
archinetai / audio-diffusion-pytorch
View on GitHub
Audio generation using diffusion models, in PyTorch.
☆2,098Jun 12, 2023Updated 3 years ago
dl4am / tutorial
View on GitHub
Deep learning for automatic mixing
☆32Aug 29, 2024Updated last year
lilymoonight / stock_tax_calculator
View on GitHub
☆39Feb 21, 2026Updated 4 months ago
CantorDigitalis / CantorDigitalis2.1
View on GitHub
Cantor Digitalis - version 2.1
☆13Jan 28, 2024Updated 2 years ago
ricardokleinklein / deepMultiSpeech
View on GitHub
Deep Multi-Speech model
☆11Jul 25, 2018Updated 7 years ago
iZotope / max_vst_renderer
View on GitHub
☆13Mar 26, 2024Updated 2 years ago
neuml / ttstokenizer
View on GitHub
Tokenizer for Text to Speech (TTS) models
☆14Jan 16, 2025Updated last year
PeiChunChang / MS-SincResNet
View on GitHub
This paper has been accepted in ACM ICMR 2021.
☆20Nov 17, 2025Updated 7 months ago
CognitiveCodes / NeuralGPT
View on GitHub
Personalized all-purpose AI assistance platform based on hierarchical cooperative multi-agent framework which utilizes websocket connecti…
☆38Aug 11, 2024Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
X-LANCE / UniCATS-CTX-vec2wav
View on GitHub
[AAAI 2024] Code for CTX-vec2wav in UniCATS
☆130Jun 11, 2024Updated 2 years ago
X-LANCE / UniCATS-CTX-txt2vec
View on GitHub
[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS
☆64Nov 18, 2024Updated last year
sizhelee / Diff-BGM
View on GitHub
official code for CVPR'24 paper Diff-BGM
☆71Oct 12, 2024Updated last year
junjun3518 / alias-free-torch
View on GitHub
Simple torch.nn.module implementation of Alias-Free-GAN style filter and resample
☆101Jul 26, 2022Updated 3 years ago
suncerock / EAsT-music-classification
View on GitHub
Audio Embeddings as Teachers for Music Classification
☆13Sep 7, 2023Updated 2 years ago
gladia-research-group / multi-source-diffusion-models
View on GitHub
☆171Aug 14, 2023Updated 2 years ago
streichgeorg / autosing
View on GitHub
☆18Jan 20, 2025Updated last year