HilaManor/AudioEditingCode

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HilaManor/AudioEditingCode)

HilaManor / AudioEditingCode

☆187

Alternatives and similar repositories for AudioEditingCode

Users that are interested in AudioEditingCode are comparing it to the libraries listed below

Sorting:

happylittlecat2333 / Auffusion
View on GitHub
Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generati…
☆192Mar 25, 2024Updated last year
sh-lee-prml / PeriodWave
View on GitHub
The official Implementation of PeriodWave and PeriodWave-Turbo
☆219Apr 14, 2025Updated 10 months ago
jaeyeonkim99 / EnCLAP
View on GitHub
Official Implementation of EnCLAP (ICASSP 2024)
☆94Jun 2, 2024Updated last year
i-need-sleep / mad
View on GitHub
☆16Sep 29, 2025Updated 5 months ago
glory20h / VoiceLDM
View on GitHub
VoiceLDM: Text-to-Speech with Environmental Context
☆191Aug 9, 2024Updated last year
gwh22 / LAFMA
View on GitHub
LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)
☆43Jun 13, 2024Updated last year
Sreyan88 / CompA
View on GitHub
Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models
☆22Jul 10, 2024Updated last year
asuni / PitchSqueezer
View on GitHub
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
☆36Jan 17, 2024Updated 2 years ago
Audio-AGI / WavJourney
View on GitHub
WavJourney: Compositional Audio Creation with LLMs
☆540Sep 28, 2023Updated 2 years ago
sony / soundctm
View on GitHub
Pytorch implementation of SoundCTM
☆100Mar 31, 2025Updated 11 months ago
LiuZH-19 / SongGen
View on GitHub
[ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation
☆304Nov 5, 2025Updated 3 months ago
AMAAI-Lab / mustango
View on GitHub
Mustango: Toward Controllable Text-to-Music Generation
☆386Jun 2, 2025Updated 8 months ago
XiangLi2022 / CM-TTS
View on GitHub
[Findings of NAACL 2024] Source code of paper CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers a…
☆69Mar 31, 2024Updated last year
sony / diffusion-timbre-transfer
View on GitHub
☆55Nov 5, 2024Updated last year
jishengpeng / TextrolSpeech
View on GitHub
[ICASSP 2024] TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models
☆183Nov 22, 2024Updated last year
EmilianPostolache / stable-audio-controlnet
View on GitHub
Fine-tune Stable Audio Open with DiT ControlNet.
☆249May 16, 2025Updated 9 months ago
zhenye234 / FlashSpeech
View on GitHub
ACM MM 2024 FlashSpeech: Efficient Zero-Shot Speech Synthesis
☆154Sep 20, 2024Updated last year
NKU-HLT / AudioEditor
View on GitHub
☆40Apr 2, 2025Updated 10 months ago
luosiallen / Diff-Foley
View on GitHub
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models
☆200May 29, 2024Updated last year
X-E-Speech / X-E-Speech-code
View on GitHub
X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion
☆111Apr 1, 2024Updated last year
zelaki / DreamSound
View on GitHub
[ICASSP'24] Investigating Personalization Methods in Text to Music Generation
☆45Mar 27, 2024Updated last year
yangdongchao / UniAudio
View on GitHub
The Open Source Code of UniAudio
☆604Jul 22, 2024Updated last year
jhtonyKoo / music_mixing_style_transfer
View on GitHub
☆180Oct 24, 2023Updated 2 years ago
thuhcsi / LightGrad
View on GitHub
☆68Jul 23, 2023Updated 2 years ago
seastar105 / pflow-encodec
View on GitHub
Implementation of TTS model based on NVIDIA P-Flow TTS Paper
☆77May 12, 2024Updated last year
luotianze666 / WaveFM
View on GitHub
[NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching
☆121Mar 27, 2025Updated 11 months ago
uthree / ddsp-vocoder
View on GitHub
☆11Nov 7, 2024Updated last year
yoongi43 / music_audio_enhancement_conformer
View on GitHub
Implementation of the paper "Exploiting Time-Frequency Conformers for Music Audio Enhancement"
☆12Mar 21, 2025Updated 11 months ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated last year
seungheondoh / lp-music-caps
View on GitHub
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
☆344Apr 8, 2024Updated last year
zhenye234 / CoMoSpeech
View on GitHub
ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
☆213Apr 26, 2024Updated last year
haidog-yaqub / EzAudio
View on GitHub
High-quality Text-to-Audio Generation with Efficient Diffusion Transformer
☆329Dec 17, 2025Updated 2 months ago
ldzhangyx / MusicMagus
View on GitHub
The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".
☆48Sep 11, 2024Updated last year
shansongliu / MU-LLaMA
View on GitHub
MU-LLaMA: Music Understanding Large Language Model
☆303Aug 18, 2025Updated 6 months ago
mct10 / RepCodec
View on GitHub
Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization
☆192Jul 12, 2024Updated last year
Grace9994 / CoMoSVC
View on GitHub
CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone
☆147Mar 23, 2024Updated last year
bytedance / Make-An-Audio-2
View on GitHub
a text-conditional diffusion probabilistic model capable of generating high fidelity audio.
☆186May 29, 2024Updated last year
X-LANCE / StoryTTS
View on GitHub
[ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations
☆142Apr 27, 2024Updated last year
y-chan / hifi-gan-misrnet
View on GitHub
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Mar 21, 2023Updated 2 years ago