mpuels / docker-py-kaldi-asr-and-model
STT Service based on Kaldi ASR
☆15Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for docker-py-kaldi-asr-and-model
- Long audio alignment using Kaldi☆25Updated 3 years ago
- Text frontend for ESPnet tts recipes☆31Updated 3 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆34Updated 4 years ago
- ☆17Updated last year
- Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave☆20Updated last year
- Addressing Text-dependent Speaker Verification Using Singing Speech☆9Updated 5 years ago
- ☆19Updated 6 years ago
- ☆22Updated 3 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 4 years ago
- it's ASR decoder and make graph project☆32Updated 2 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Updated 7 months ago
- Online streaming speaker change detection model in Pytorch☆36Updated last year
- This is now the official location of the Kaldi project.☆13Updated 5 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Updated last year
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- wake word spotting with kaldi☆19Updated 3 years ago
- ☆33Updated 2 years ago
- Text-to-Speech tutorial at SLTU 2016☆35Updated 8 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Updated 5 years ago
- Deploy Kaldi models using grpc for bidirectional streaming.☆17Updated last month
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆14Updated 4 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆62Updated 4 years ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆28Updated last year
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Updated 5 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆42Updated 3 years ago
- ☆31Updated 2 months ago
- Phone-level evaluation of L2 speakers (GOP algorithm)☆26Updated 7 years ago