AgentCooper2002/EDMSound

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AgentCooper2002/EDMSound)

AgentCooper2002 / EDMSound

Codebase and project page for EDMSound

☆35

Alternatives and similar repositories for EDMSound

Users that are interested in EDMSound are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lifeiteng / VoiceBox
View on GitHub
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
☆29Aug 4, 2023Updated 2 years ago
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
vtuber-plan / FlowVAE
View on GitHub
☆17Dec 12, 2023Updated 2 years ago
gzhu06 / Cacophony
View on GitHub
Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986
☆49Jan 19, 2026Updated 6 months ago
cpdu / unicats
View on GitHub
☆63Jan 15, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
RF5 / simple-asgan
View on GitHub
Training code and trained checkpoints for ASGAN.
☆62Dec 27, 2023Updated 2 years ago
happylittlecat2333 / Auffusion
View on GitHub
Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generati…
☆194Mar 25, 2024Updated 2 years ago
reppy4620 / vocoders
View on GitHub
My vocoder experiments
☆31Jul 26, 2025Updated 11 months ago
jhuang448 / MultilingualALT
View on GitHub
Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""
☆15Jun 28, 2024Updated 2 years ago
X-LANCE / UniCATS-CTX-vec2wav
View on GitHub
[AAAI 2024] Code for CTX-vec2wav in UniCATS
☆130Jun 11, 2024Updated 2 years ago
haiciyang / LaDiffCodec
View on GitHub
ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.
☆56Nov 16, 2025Updated 8 months ago
plassma / symbolic-music-discrete-diffusion
View on GitHub
☆50Aug 21, 2023Updated 2 years ago
P1ping / TokAN-Legacy
View on GitHub
☆27Jun 22, 2026Updated 3 weeks ago
BrightGu / MediumVC
View on GitHub
Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features
☆55Oct 11, 2021Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Mddct / usm-tokenizer
View on GitHub
semantic tokenizer for speech and music
☆20Jul 6, 2025Updated last year
MWM-io / nansypp
View on GitHub
Unofficial implementation of NANSY++ in Pytorch Lightning
☆50Mar 11, 2024Updated 2 years ago
adefossez / audio_mod_idessai
View on GitHub
Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.
☆13Sep 13, 2024Updated last year
jmoso13 / jukebox-diffusion
View on GitHub
☆107Oct 16, 2023Updated 2 years ago
chloelavrat / torch-ddsp
View on GitHub
A real time implementation of the ddsp from google magenta.
☆16Nov 8, 2021Updated 4 years ago
carlthome / pmqd
View on GitHub
Perceived Music Quality Dataset
☆12Jul 1, 2024Updated 2 years ago
dzluke / DAFX2024
View on GitHub
Code for paper "Network Bending of Diffusion Models for Audio-Visual Generation" at DAFx 2024
☆17Aug 26, 2025Updated 10 months ago
yangdongchao / LLM-Codec
View on GitHub
The open source code for LLM-Codec
☆147Aug 18, 2024Updated last year
ryota-komatsu / speech_resynth
View on GitHub
Speech Resynthesis and Language Modeling
☆27Jun 11, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
mbrotos / SoundSeg
View on GitHub
Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation
☆13Feb 18, 2026Updated 5 months ago
jamesparsloe / llm.speech
View on GitHub
Trying to build an all in one speech-text language model - a bit like GPT-4o
☆22Jun 1, 2024Updated 2 years ago
aask1357 / hilcodec
View on GitHub
High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec
☆120Jun 23, 2025Updated last year
MiscellaneousStuff / PhoneLM
View on GitHub
(R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.
☆48Sep 4, 2023Updated 2 years ago
yanghaha0908 / FastHuBERT
View on GitHub
Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning
☆100Nov 20, 2024Updated last year
samsad35 / code-ancogen
View on GitHub
[ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
☆14Mar 11, 2025Updated last year
nkandpa2 / music_enhancement
View on GitHub
Implementation for "Music Enhancement via Image Translation and Vocoding"
☆54Apr 28, 2022Updated 4 years ago
bytedance / Make-An-Audio-2
View on GitHub
a text-conditional diffusion probabilistic model capable of generating high fidelity audio.
☆197May 29, 2024Updated 2 years ago
kaist-ami / SoundBrush
View on GitHub
☆14Dec 8, 2025Updated 7 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
hanshounsu / d3rm
View on GitHub
☆14Feb 3, 2026Updated 5 months ago
anton-kashkin / hifi_vc
View on GitHub
☆25Jan 24, 2023Updated 3 years ago
CODEJIN / DiffSingerKR
View on GitHub
☆25Aug 31, 2024Updated last year
WangHelin1997 / SSR-Speech
View on GitHub
SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis
☆154Jan 1, 2025Updated last year
sony / soundctm
View on GitHub
Pytorch implementation of SoundCTM
☆101Mar 31, 2025Updated last year
X-LANCE / UniCATS-CTX-txt2vec
View on GitHub
[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS
☆64Nov 18, 2024Updated last year
yangdongchao / ALMTokenizer2
View on GitHub
The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…
☆45Sep 5, 2025Updated 10 months ago