☆24May 6, 2025Updated 9 months ago
Alternatives and similar repositories for DMSE4TTS
Users that are interested in DMSE4TTS are comparing it to the libraries listed below
Sorting:
- Vocal Remover using Deep Neural Networks☆19Dec 31, 2024Updated last year
- Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.☆10Aug 13, 2023Updated 2 years ago
- A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features☆10Oct 5, 2022Updated 3 years ago
- ☆10Dec 22, 2023Updated 2 years ago
- ☆26Mar 20, 2024Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆13Jul 22, 2024Updated last year
- Research code for the paper "Training speaker recognition systems with limited data" at https://arxiv.org/abs/2203.14688☆12Dec 2, 2024Updated last year
- ☆22Oct 17, 2024Updated last year
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆38Feb 17, 2026Updated 2 weeks ago
- ☆14Aug 19, 2024Updated last year
- ☆20Sep 20, 2024Updated last year
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS☆40Aug 4, 2023Updated 2 years ago
- ☆18Jan 18, 2024Updated 2 years ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Feb 2, 2026Updated last month
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆19Feb 9, 2025Updated last year
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆25Apr 16, 2023Updated 2 years ago
- English conversation corpus for conversational TTS.☆21Mar 13, 2023Updated 2 years ago
- Objective measures of speech quality SNR☆19Aug 1, 2019Updated 6 years ago
- ☆44Sep 19, 2024Updated last year
- Generative Adversarial Networks for different impaired speech conversions☆39Jul 6, 2023Updated 2 years ago
- ☆50Aug 16, 2023Updated 2 years ago
- Implementations of audio watermarking methods, speech quality metrics and attacks in different domains.☆26Updated this week
- Ultimate Vocal Remover Inference CLI☆109Updated this week
- ☆151Apr 25, 2025Updated 10 months ago
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆23Nov 25, 2023Updated 2 years ago
- Code for paper "Noise-aware Speech Enhancement using Diffusion Probabilistic Model"☆88Jun 10, 2024Updated last year
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆103Mar 19, 2024Updated last year
- ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model☆213Apr 26, 2024Updated last year
- A toolkit dedicate for speech evaluation.☆23Sep 26, 2024Updated last year
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- ☆28Nov 7, 2023Updated 2 years ago
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆36Jun 20, 2023Updated 2 years ago
- Workflow for forced alignment between languages☆23Jan 13, 2026Updated last month
- A sequence-to-sequence voice conversion toolkit.☆108Jul 5, 2024Updated last year
- Deep Speech Distances PyTorch☆29Feb 21, 2022Updated 4 years ago
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆23Dec 14, 2023Updated 2 years ago
- ☆58Jun 28, 2024Updated last year