dmse4tts/DMSE4TTS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dmse4tts/DMSE4TTS)

dmse4tts / DMSE4TTS

☆24

Alternatives and similar repositories for DMSE4TTS

Users that are interested in DMSE4TTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yxlllc / vocal-remover
View on GitHub
Vocal Remover using Deep Neural Networks
☆21Dec 31, 2024Updated last year
zds-potato / multilingual-phonetic-sv
View on GitHub
☆10Dec 22, 2023Updated 2 years ago
mushanshanshan / ESLTTS
View on GitHub
ESLTTS dataset
☆16Feb 6, 2025Updated last year
nikvaessen / w2v2-speaker-few-samples
View on GitHub
Research code for the paper "Training speaker recognition systems with limited data" at https://arxiv.org/abs/2203.14688
☆13Dec 2, 2024Updated last year
ffxiong / stsubnet
View on GitHub
☆22Oct 17, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
uthree / fastersvc
View on GitHub
☆26Mar 20, 2024Updated 2 years ago
aparnadutta / code-mixed-lid
View on GitHub
Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.
☆10Aug 13, 2023Updated 2 years ago
thuhcsi / english-conversation-corpus
View on GitHub
English conversation corpus for conversational TTS.
☆21Mar 13, 2023Updated 3 years ago
ludlows / snreval
View on GitHub
Objective measures of speech quality SNR
☆20Aug 1, 2019Updated 6 years ago
MiukkaZh / MGT
View on GitHub
Learning Domain-Invariant Transformation for Speaker Verification.
☆11Jun 13, 2023Updated 3 years ago
bsxfan / PSDA
View on GitHub
Probabilistic Spherical Discriminant Analysis
☆12Oct 29, 2022Updated 3 years ago
dangf15 / THLNet
View on GitHub
☆38Jun 5, 2023Updated 3 years ago
line / WaveTrainerFit
View on GitHub
Official implementation of "Wave-Trainer-Fit: Neural Vocoder with Trainable Prior and Fixed-Point Iteration towards High-Quality Speech G…
☆16Feb 6, 2026Updated 5 months ago
anas-rz / specmix-pytorch
View on GitHub
A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features
☆10Oct 5, 2022Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
kaistmm / AdaptVC
View on GitHub
☆17Jun 2, 2025Updated last year
jonashaag / speech-enhancement
View on GitHub
Collection of papers, datasets and tools on the topic of Speech Dereverberation and Speech Enhancement
☆25Jan 23, 2022Updated 4 years ago
chenchy / D3Net
View on GitHub
A pytorch implementation of D3Net.
☆11Aug 8, 2021Updated 4 years ago
bfs18 / rfwave
View on GitHub
☆152Apr 25, 2025Updated last year
xinan-chen / AP_BWE
View on GitHub
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction
☆13Jul 22, 2024Updated 2 years ago
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
idiap / knn-tts
View on GitHub
Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model
☆36Apr 29, 2025Updated last year
Audio-AGI / dcase2024_task9_baseline
View on GitHub
Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"
☆26Mar 27, 2024Updated 2 years ago
zhenye234 / CoMoSpeech
View on GitHub
ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
☆214Apr 26, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Maitreyapatel / speech-conversion-between-different-modalities
View on GitHub
Generative Adversarial Networks for different impaired speech conversions
☆39Jul 6, 2023Updated 3 years ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
aispeech-lab / SDNet
View on GitHub
Pytorch implemention of SDNet
☆23Jun 1, 2021Updated 5 years ago
AndreevP / speech_distances
View on GitHub
Deep Speech Distances PyTorch
☆29Feb 21, 2022Updated 4 years ago
wngh1187 / Diff-SV
View on GitHub
Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…
☆23Dec 14, 2023Updated 2 years ago
junyuchen-cjy / DTTNet-Pytorch
View on GitHub
An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation
☆109Mar 19, 2024Updated 2 years ago
glory20h / VoiceLDM
View on GitHub
VoiceLDM: Text-to-Speech with Environmental Context
☆194Aug 9, 2024Updated last year
xinghua-qu / AudioQR
View on GitHub
☆11Jul 14, 2023Updated 3 years ago
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Adibian / ResGrad
View on GitHub
Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech
☆20Feb 9, 2025Updated last year
VoiceBank-NTPU-TW / VoiceBank-2023
View on GitHub
VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.
☆40Jan 4, 2026Updated 6 months ago
KoMyeongJin / SpecDiff-GAN
View on GitHub
Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS
☆40Aug 4, 2023Updated 2 years ago
zawnpn / FFT-GPU-Accel
View on GitHub
Fast Fourier Transform Acceleration Algorithm. (Accelerated by CUDA)
☆12Jul 8, 2018Updated 8 years ago
zyy-fc / CGMM-MVDR
View on GitHub
☆10Aug 3, 2020Updated 5 years ago
tomer-ros / mosnet-speech-enhancement
View on GitHub
Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement
☆25Apr 16, 2023Updated 3 years ago
ljuvela / SourceFilterNeuralFormants
View on GitHub
☆21Sep 20, 2024Updated last year