PyTorch based speaker embedding model
☆16Apr 13, 2024Updated last year
Alternatives and similar repositories for Speaker_Embedding_Torch
Users that are interested in Speaker_Embedding_Torch are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- ☆10Apr 8, 2024Updated last year
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆29Nov 6, 2020Updated 5 years ago
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Jan 14, 2021Updated 5 years ago
- A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack☆10Feb 19, 2018Updated 8 years ago
- Code & demo for the animation of still facial landmarks from an initial pose.☆15Jan 19, 2023Updated 3 years ago
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆54Sep 14, 2022Updated 3 years ago
- ☆30Jun 30, 2020Updated 5 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Sep 26, 2019Updated 6 years ago
- An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".☆34Apr 26, 2021Updated 4 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Dec 10, 2020Updated 5 years ago
- ☆37May 8, 2021Updated 4 years ago
- ☆15May 8, 2021Updated 4 years ago
- TPSE-GST Tacotron2☆14May 1, 2019Updated 6 years ago
- GAN series for voice conversion on VCC2018 dataset☆17Aug 27, 2020Updated 5 years ago
- Code associated with the paper: Neural Representations for Modeling Variation in Speech.☆17Mar 10, 2022Updated 3 years ago
- PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis☆41Feb 20, 2022Updated 4 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- ☆17Aug 27, 2025Updated 6 months ago
- Tensorflow Implements Chinese Word Segment use LSTM+CRF and Dilated CNN+CRF☆15Jul 16, 2018Updated 7 years ago
- WaveNet implementation using tf.estimator☆21Jul 6, 2023Updated 2 years ago
- A Neural Audio Codec (NAC) for Universal Audio☆44May 30, 2025Updated 9 months ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Jan 27, 2020Updated 6 years ago
- TTS for pitch-accented language. Korean dialect DB.☆157May 12, 2023Updated 2 years ago
- ☆25Apr 24, 2019Updated 6 years ago
- ☆64May 23, 2022Updated 3 years ago
- Lightweight speaker anonymization [IEEE SLT2021]☆27Jun 6, 2022Updated 3 years ago
- ☆23Jul 4, 2020Updated 5 years ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆28May 25, 2023Updated 2 years ago
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆27Mar 20, 2021Updated 4 years ago
- This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance No…☆115Dec 7, 2020Updated 5 years ago
- Implementation of Global Style Token Tacotron in TensorFlow2☆26Sep 28, 2020Updated 5 years ago
- wavenet vocoder using tensorflow☆26Feb 18, 2018Updated 8 years ago
- voice conversion system☆25Jun 10, 2020Updated 5 years ago
- A Pytorch Implementation of MelGAN☆66Oct 22, 2019Updated 6 years ago
- Tacotron2 with Global Style Tokens☆65Apr 19, 2019Updated 6 years ago
- Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention☆203Nov 30, 2020Updated 5 years ago