diggerdu / Dvorak
A startup keyword spotting implementation
☆9Updated 7 years ago
Alternatives and similar repositories for Dvorak:
Users that are interested in Dvorak are comparing it to the libraries listed below
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated last year
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Updated 2 years ago
- C\CPP implementation of Keyword Spotting, following the LSTM approach, based on Tensorflow☆9Updated 7 years ago
- MESSL wrappers etc for JSALT 2015, including CHiME3☆8Updated 6 years ago
- Extended speech recognition neural network based on Kaldi for reproducible research☆15Updated 9 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 4 years ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Updated 3 years ago
- Losses and decoders for end-to-end ASR and OCR☆33Updated 4 years ago
- Unsupervised speech activity detection system.☆11Updated 6 years ago
- Tsinghua University SPMI Lab array processing toolkit☆18Updated 8 years ago
- A library of speech gadgets.☆13Updated 2 years ago
- ☆31Updated 6 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 3 years ago
- This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"☆16Updated 6 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 6 years ago
- A tensorflow implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis☆20Updated 5 years ago
- Library for real-time digital signal processing of microphone array signals. It is based on DSPONE adn WIPP and can perform binarula loca…☆15Updated 7 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Updated 6 years ago
- A perceptual weighting filter loss for DNN training in speech enhancement☆24Updated 2 years ago
- Voice Conversion using Tacotron.☆11Updated 2 years ago
- ☆12Updated 7 years ago
- ☆9Updated 6 years ago
- This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…☆9Updated 2 years ago
- text to speech☆10Updated 10 months ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 5 years ago
- Custom decoders for Kaldi☆13Updated 5 years ago
- ☆20Updated 5 years ago