ttslr / i-ETTSLinks

[InterSpeech'2021] Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability

☆8

Alternatives and similar repositories for i-ETTS

Users that are interested in i-ETTS are comparing it to the libraries listed below

Sorting:

walker-hyf / FCTalker
FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)
☆25Updated last year
BUTSpeechFIT / TS_SUPERB
☆15Updated 3 months ago
interactiveaudiolab / emphases
Crowdsourced and Automatic Speech Prominence Estimation
☆21Updated last year
ttslr / MonTTS
☆13Updated 3 years ago
csalt-research / accented-codebooks-asr
☆18Updated 10 months ago
rishikksh20 / iSTFT-Avocodo-pytorch
Ultrafast GAN based Vocoder for Text to Speech
☆50Updated 3 years ago
IU-SAIGE / pse
Efficient Personalized Speech Enhancement through Self-Supervised Learning
☆21Updated 2 years ago
lifeiteng / TTS-TextAnalyzer
TTS Text Analyzer
☆32Updated last year
lmxue / ICASSP2022_TTS_VC_Summary
ICASSP2022 TTS&VC Summary
☆14Updated 3 years ago
thuhcsi / SnakeGAN
Please visit https://thuhcsi.github.io/SnakeGAN/
☆37Updated 2 years ago
cyhuang-tw / robust-vc
☆11Updated 3 years ago
Chengyuann / AutoStyle-TTS
Official PyTorch implementation of "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis…
☆14Updated 4 months ago
prairie-schooner / wav2vec-vc
☆11Updated 2 years ago
hmohebbi / disentangling_representations
☆12Updated 9 months ago
yluo42 / SRVQ
Spherical residual vector quantization (SRVQ)
☆30Updated 10 months ago
shaojinding / Adversarial-Many-to-Many-VC
[InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …
☆39Updated 2 years ago
thuhcsi / icassp2021-emotion-tts
Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/
☆34Updated 2 years ago
meelement / noise_adversarial_tacotron
Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…
☆17Updated 5 years ago
WangHelin1997 / DuTa-VC
Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…
☆37Updated last year
NeuroWave-ai / CUCVAE-TTS
☆25Updated 3 years ago
rhoposit / icassp2021
☆15Updated 4 years ago
exercise-book-yq / FreeCodec
FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS
☆21Updated 10 months ago
ajaybati / miipher2.0
Reimplementation of Miipher
☆22Updated last year
roholazandie / ryan-tts
☆18Updated 3 years ago
Kevin-naticl / LLaSE
LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement
☆16Updated last week
bastibe / MAPS-Scripts
A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.
☆24Updated 4 years ago
xcmyz / ConvTasNet4BasisMelGAN
This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.
☆20Updated 3 years ago
lucadellalib / discrete-wavlm-codec
A neural speech codec based on discrete WavLM representations
☆24Updated 10 months ago
WangHelin1997 / Automatic_Speech_Annotator
Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…
☆33Updated last year
AlexandaJerry / SingingVoice-MFA-Training
MFA acoustic model training based on Opencpop
☆15Updated 2 years ago