ttslr / i-ETTSLinks
[InterSpeech'2021] Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability
☆8Updated 9 months ago
Alternatives and similar repositories for i-ETTS
Users that are interested in i-ETTS are comparing it to the libraries listed below
Sorting:
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆25Updated last year
- ☆15Updated 3 months ago
- Crowdsourced and Automatic Speech Prominence Estimation☆21Updated last year
- ☆13Updated 3 years ago
- ☆18Updated 10 months ago
- Ultrafast GAN based Vocoder for Text to Speech☆50Updated 3 years ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆21Updated 2 years ago
- TTS Text Analyzer☆32Updated last year
- ICASSP2022 TTS&VC Summary☆14Updated 3 years ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Updated 2 years ago
- ☆11Updated 3 years ago
- Official PyTorch implementation of "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis…☆14Updated 4 months ago
- ☆11Updated 2 years ago
- ☆12Updated 9 months ago
- Spherical residual vector quantization (SRVQ)☆30Updated 10 months ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated 2 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Updated 2 years ago
- Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…☆17Updated 5 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Updated last year
- ☆25Updated 3 years ago
- ☆15Updated 4 years ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆21Updated 10 months ago
- Reimplementation of Miipher☆22Updated last year
- ☆18Updated 3 years ago
- LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement☆16Updated last week
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Updated 4 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆20Updated 3 years ago
- A neural speech codec based on discrete WavLM representations☆24Updated 10 months ago
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆33Updated last year
- MFA acoustic model training based on Opencpop☆15Updated 2 years ago