CODEJIN / Speaker_Embedding_Torch
PyTorch based speaker embedding model
☆15Updated 9 months ago
Alternatives and similar repositories for Speaker_Embedding_Torch:
Users that are interested in Speaker_Embedding_Torch are comparing it to the libraries listed below
- An evaluation toolkit for voice conversion models.☆40Updated 3 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆28Updated 4 years ago
- ☆24Updated 2 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Updated last year
- An unofficial implementation of Vector Quantization Voice Conversion (VQVC).☆29Updated 3 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆39Updated 2 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆45Updated 2 years ago
- GAN series for voice conversion on VCC2018 dataset☆16Updated 4 years ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 3 years ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated last year
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS☆21Updated 2 years ago
- Official PyTorch implementation of TTS Style Transfer☆24Updated 2 years ago
- Voice emotion conversion model for DS/ML master's thesis. F0 contour mapping in sequence-to-sequence RNN-LSTM architecture in Tensorflow.☆27Updated 6 years ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15Updated 2 years ago
- ☆16Updated 2 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆35Updated last year
- ☆30Updated 2 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated last year
- ☆51Updated 5 years ago
- with alignment learning and continuous wavelet transform☆20Updated 2 years ago
- Voice conversion (VC) investigation using three variants of VAE☆57Updated 5 years ago
- A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.☆44Updated last year
- ☆53Updated 4 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 3 years ago
- pytorch implementation of "Emotional Voice Conversion using Multitask Learning with Text-to-Speech", Accepted to ICASSP 2020☆28Updated last year
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37Updated 3 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- Text to Speech Synthesis based on controllable latent representation☆14Updated 5 years ago
- Speech (audio) subjective evaluation system☆37Updated 4 years ago