CMsmartvoice / Unet-TTSView external linksLinks
One-shot TTS with Improved Unseen Speaker and Style Transfer
☆37Mar 2, 2022Updated 3 years ago
Alternatives and similar repositories for Unet-TTS
Users that are interested in Unet-TTS are comparing it to the libraries listed below
Sorting:
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- One Shot Voice Cloning base on Unet-TTS☆245Mar 22, 2022Updated 3 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Dec 6, 2022Updated 3 years ago
- ☆15May 8, 2021Updated 4 years ago
- Official implementation of Meta-StyleSpeech and StyleSpeech☆252Feb 9, 2022Updated 4 years ago
- Lightweight speaker anonymization [IEEE SLT2021]☆27Jun 6, 2022Updated 3 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Sep 10, 2021Updated 4 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Python library for audio augmentation☆85Jul 6, 2023Updated 2 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆55Oct 15, 2021Updated 4 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Dec 16, 2022Updated 3 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- Finally, some decent sample sentences☆23Dec 3, 2023Updated 2 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- ☆37Mar 26, 2024Updated last year
- ☆55Jan 13, 2023Updated 3 years ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆89Mar 5, 2022Updated 3 years ago
- ☆37May 8, 2021Updated 4 years ago
- ABX and kaldi experiments on speech corpora made easy☆33Oct 7, 2024Updated last year
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Dec 17, 2020Updated 5 years ago
- Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.☆194Jun 8, 2023Updated 2 years ago
- ☆25Mar 12, 2022Updated 3 years ago
- ☆30Jan 22, 2026Updated 3 weeks ago
- MarkMelGen is a Markov Melody Generation program that takes configuration, lyric, and example music files and creates a tune for the sup…☆14Jan 29, 2026Updated 2 weeks ago
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation☆197Feb 10, 2022Updated 4 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44May 9, 2023Updated 2 years ago
- Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…☆17Apr 27, 2023Updated 2 years ago
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"☆44Apr 10, 2023Updated 2 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Jul 25, 2024Updated last year
- Unofficial Pytorch Implementation of WaveGrad2☆112Aug 18, 2021Updated 4 years ago
- the Tensorflow version of multi-speaker TTS training with feedback constraint☆40Oct 12, 2020Updated 5 years ago
- A system works on singing voice synthesis☆79Jan 11, 2023Updated 3 years ago
- ☆28Oct 7, 2025Updated 4 months ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Oct 8, 2021Updated 4 years ago
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆262Nov 15, 2025Updated 2 months ago
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆146Jun 6, 2022Updated 3 years ago
- PPG-Based Voice Conversion☆347Jul 22, 2022Updated 3 years ago