CSLT-THU / IS2019-VAELinks
Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"
☆11Updated 2 years ago
Alternatives and similar repositories for IS2019-VAE
Users that are interested in IS2019-VAE are comparing it to the libraries listed below
Sorting:
- A tensorflow implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis☆20Updated 6 years ago
- ☆34Updated 6 years ago
- Filtering and Noise Adding Tool☆29Updated 3 years ago
- pytorch CTC implementation for ASR. Use eesen's fst decoder framework☆10Updated 5 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Updated 7 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Updated 3 years ago
- GlottDNN vocoder and tools for training DNN excitation models☆32Updated 4 years ago
- Speech enhancement using mimic loss☆16Updated 6 years ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆39Updated 6 years ago
- using world vocoder to extract features and make data for training neural networks☆11Updated 8 years ago
- Overlapped Speech detection in Multi-party Conversations☆22Updated 7 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆36Updated 7 years ago
- Denoising autoencoders for speaker identification on MCE 2018 challenge☆12Updated 7 years ago
- Efficient Neural Architecture Search via Straight-Through Gradients☆13Updated 5 years ago
- ☆20Updated 5 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Updated 3 years ago
- This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"☆16Updated 7 years ago
- Based on https://github.com/fatchord/WaveRNN☆24Updated 5 years ago
- APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative train…☆14Updated 4 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 6 years ago
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆14Updated 6 years ago
- Transformer based ASR Engine.☆13Updated 4 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Updated 4 years ago
- ☆25Updated 6 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 3 years ago
- RepVgg + HiFiGAN☆34Updated 3 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 6 years ago
- ☆25Updated 3 years ago
- Code for ICASSP 2019 paper☆18Updated 7 years ago
- Kaldi extended by Kaituo XU with new features in nnet1.☆12Updated 6 years ago