CSLT-THU / IS2019-VAE
Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"
☆11Updated 2 years ago
Alternatives and similar repositories for IS2019-VAE:
Users that are interested in IS2019-VAE are comparing it to the libraries listed below
- using world vocoder to extract features and make data for training neural networks☆11Updated 7 years ago
- pytorch CTC implementation for ASR. Use eesen's fst decoder framework☆10Updated 5 years ago
- ☆18Updated 6 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 5 years ago
- GlottDNN vocoder and tools for training DNN excitation models☆32Updated 4 years ago
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆14Updated 5 years ago
- Addressing Text-dependent Speaker Verification Using Singing Speech☆9Updated 5 years ago
- 一个开源的中文歌声合成数据集。An open-source Chinese singing synthesizing dataset.☆21Updated 5 years ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Updated 4 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Updated 2 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆35Updated 7 years ago
- ☆34Updated 5 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Updated 7 years ago
- Speech enhancement using mimic loss☆16Updated 5 years ago
- Simulation of parallel synthesis with LPCNet vocoder☆14Updated 4 years ago
- Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…☆17Updated 5 years ago
- ☆18Updated 5 years ago
- Overlapped Speech detection in Multi-party Conversations☆21Updated 7 years ago
- Kaldi extended by Kaituo XU with new features in nnet1.☆12Updated 6 years ago
- Model Fusion Based Prosody Prediction☆17Updated 7 years ago
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Updated 3 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 5 years ago
- ☆16Updated 5 years ago
- ☆11Updated 4 years ago
- This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training mor…☆37Updated 2 years ago
- ☆18Updated 5 years ago
- tts fronted-end☆11Updated 6 years ago
- ☆15Updated 3 years ago
- A repository comprising of code for generation of noisy speech data from clean data using deep learning methods☆13Updated 3 years ago
- ☆26Updated 4 years ago