Akella17 / speaker-embeddingLinks
A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack
☆10Updated 7 years ago
Alternatives and similar repositories for speaker-embedding
Users that are interested in speaker-embedding are comparing it to the libraries listed below
Sorting:
- ☆51Updated 6 years ago
 - PyTorch based speaker embedding model☆16Updated last year
 - An evaluation toolkit for voice conversion models.☆42Updated 4 years ago
 - Gaussian Mixture VAE Tacotron☆53Updated 2 years ago
 - Implementation of Multi speaker TTS☆51Updated 4 years ago
 - Implementation of Global Style Token Tacotron in TensorFlow2☆26Updated 5 years ago
 - An unofficial implementation of Vector Quantization Voice Conversion (VQVC).☆29Updated 4 years ago
 - Voice emotion conversion model for DS/ML master's thesis. F0 contour mapping in sequence-to-sequence RNN-LSTM architecture in Tensorflow.☆27Updated 7 years ago
 - Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Updated 5 years ago
 - Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Updated 3 years ago
 - Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Updated 2 years ago
 - Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
 - ☆40Updated 3 years ago
 - ☆52Updated 4 years ago
 - Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆29Updated 4 years ago
 - WaveNet auto-ancoders for ZeroSpeech challenge 2020☆37Updated 3 years ago
 - PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆39Updated 6 years ago
 - Tacotron2 with Global Style Tokens☆65Updated 6 years ago
 - UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆75Updated 4 years ago
 - Korean Emotional End-to-End Neural Speech synthesizer, ML4audio, NIPS2017☆72Updated 6 years ago
 - RawNet: Fast End-to-End Neural Vocoder☆42Updated 6 years ago
 - ☆24Updated 3 years ago
 - Sequence-to-sequence TTS based on Kyubyong's dc_tts☆61Updated 2 years ago
 - streaming attention networks for end-to-end automatic speech recognition☆55Updated 5 years ago
 - VQVAE for Unsupervised Voice Conversion☆21Updated 6 years ago
 - Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆53Updated 5 years ago
 - Voice conversion (VC) investigation using three variants of VAE☆58Updated 6 years ago
 - Speech (audio) subjective evaluation system☆42Updated 5 years ago
 - Official PyTorch implementation of Speaker Conditional WaveRNN☆110Updated 3 years ago
 - An unofficial implementation of https://arxiv.org/abs/2005.05106☆47Updated 4 years ago