shahules786 / mayavoz
View external linksLinks

Pytorch based speech enhancement toolkit.

☆336

Alternatives and similar repositories for mayavoz

Users that are interested in mayavoz are comparing it to the libraries listed below

Sorting:

sushant-t / tts-trainer
View on GitHub
Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…
☆30May 27, 2023Updated 2 years ago
shang0712 / HierTTS
View on GitHub
☆46Apr 16, 2023Updated 2 years ago
zkx06111 / WSRGlow
View on GitHub
The official implementation of the Interspeech 2021 paper WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution.
☆127Sep 7, 2021Updated 4 years ago
hcy71o / MB-iSTFT-VITS-with-AutoVocoder
View on GitHub
Incorporating AutoVocoder to MB-iSTFT-VITS
☆48Dec 1, 2022Updated 3 years ago
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
yl4579 / PitchExtractor
View on GitHub
Deep Neural Pitch Extractor for Voice Conversion and TTS Training
☆146Aug 22, 2022Updated 3 years ago
maum-ai / phaseaug
View on GitHub
ICASSP 2023 Accepted
☆189May 6, 2024Updated last year
Stylish-TTS / stylish-tts
View on GitHub
High quality text-to-speech based on StyleTTS 2.
☆72Updated this week
brentspell / torch-yin
View on GitHub
Yin pitch estimator in PyTorch
☆117Nov 7, 2022Updated 3 years ago
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
sp-nitech / diffsptk
View on GitHub
A differentiable version of SPTK
☆192Feb 3, 2026Updated last week
ruizhecao96 / CMGAN
View on GitHub
Conformer-based Metric GAN for speech enhancement
☆412May 3, 2024Updated last year
hs-oh-prml / DurFlexEVC
View on GitHub
☆82Jan 22, 2025Updated last year
lakahaga / dc-comix-tts
View on GitHub
Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer
☆75Aug 21, 2023Updated 2 years ago
MiniXC / LightningFastSpeech2
View on GitHub
☆55Jan 13, 2023Updated 3 years ago
brentspell / hifi-gan-bwe
View on GitHub
Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.
☆223Oct 20, 2023Updated 2 years ago
ssi-research / FQSE
View on GitHub
Fully Quantized Neural Networks For Speech Enhancement
☆63Feb 15, 2024Updated 2 years ago
lars76 / fastspeech2-clean
View on GitHub
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
☆18Aug 16, 2024Updated last year
vliu15 / adversarial-tts
View on GitHub
End-to-end Text-to-Speech with Generative Adversarial Networks
☆20Feb 6, 2021Updated 5 years ago
yxlu-0102 / MP-SENet
View on GitHub
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
☆469May 19, 2025Updated 8 months ago
vtuber-plan / hifi-gan
View on GitHub
An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.
☆32Apr 10, 2023Updated 2 years ago
YoungSeng / SRD-VC
View on GitHub
Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)
☆119Feb 7, 2024Updated 2 years ago
schmiph2 / pysepm
View on GitHub
Python implementation of performance metrics in Loizou's Speech Enhancement book
☆447Feb 15, 2025Updated last year
ishine / Project_sp_ehance_matlab
View on GitHub
☆12Jun 17, 2019Updated 6 years ago
YangAi520 / NSPP
View on GitHub
☆54Mar 2, 2023Updated 2 years ago
Rongjiehuang / GenerSpeech
View on GitHub
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
☆329Feb 9, 2024Updated 2 years ago
unilight / s3prl-vc
View on GitHub
S3PRL-VC: A Voice Conversion Toolkit based on S3PRL
☆101Jun 26, 2024Updated last year
X-LANCE / VoiceFlow-TTS
View on GitHub
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
☆366Sep 3, 2024Updated last year
CODEJIN / VITS_Diffusion
View on GitHub
☆26Sep 22, 2022Updated 3 years ago
audiolabs / torch-pesq
View on GitHub
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
☆222Jul 14, 2023Updated 2 years ago
Audio-WestlakeU / FullSubNet
View on GitHub
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
☆596Aug 19, 2023Updated 2 years ago
ex3ndr / supervoice-enhance
View on GitHub
Supervoice diffusion enhance
☆28Jul 15, 2024Updated last year
naver-ai / RapFlow-TTS
View on GitHub
☆52Jul 16, 2025Updated 6 months ago
CODEJIN / NaturalSpeech2
View on GitHub
☆140Jan 7, 2024Updated 2 years ago
IMLHF / Speech-Enhancement-Measures
View on GitHub
speech enhancement metrics：CSIG, CBAK, CMOS, SSNR, PESQ, STOI, ESTOI, SNR, IS, LLR, WSS
☆71Jul 21, 2023Updated 2 years ago
RookieJunChen / FullSubNet-plus
View on GitHub
The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".
☆288Jul 26, 2025Updated 6 months ago
will-rice / denoisers
View on GitHub
Simple PyTorch Denoisers for Waveform Audio
☆40Dec 23, 2025Updated last month
WangHelin1997 / Fast-GeCo
View on GitHub
Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction
☆46Nov 19, 2024Updated last year
seorim0 / DCCRN-with-various-loss-functions
View on GitHub
DCCRN with various loss functions
☆103Sep 29, 2022Updated 3 years ago

shahules786 / mayavozView external linksLinks

Alternatives and similar repositories for mayavoz

shahules786 / mayavoz
View external linksLinks