dtreskunov / tiny-kaldi
Kaldi API for Android, Python and Node. Forked from vosk-api with minimal modifications.
☆16Updated 4 years ago
Alternatives and similar repositories for tiny-kaldi:
Users that are interested in tiny-kaldi are comparing it to the libraries listed below
- Kaldi based speaker verification☆47Updated 7 years ago
- On-device voice activity detection (VAD) powered by deep learning☆206Updated last week
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Updated last year
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 5 years ago
- Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech Recognition for Welsh with HuggingFace☆14Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆111Updated 2 years ago
- Various algorithms for voice activity detection☆22Updated 8 years ago
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆13Updated 4 years ago
- A demo of android key word spoting based on tensorflow tutial example☆27Updated 4 years ago
- This repository is a collection of TTS Models in TFLite☆192Updated 4 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated last year
- End-to-end spoken language identification out of the box.☆48Updated 4 years ago
- A random forest classifier to predict the age-group and gender of a speaker from voice measurements.☆18Updated 5 years ago
- py-webrtcvad wrapper for trimming speech clips☆48Updated 2 years ago
- the Tensorflow version of multi-speaker TTS training with feedback constraint☆40Updated 4 years ago
- Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and …☆20Updated 2 years ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated last year
- ☆61Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Updated 5 years ago
- Tunable pipelines☆32Updated last month
- An online speech recognition extension toolkit of Kaldi☆56Updated 3 years ago
- ☆17Updated 2 years ago
- Went online decode demo☆29Updated 3 years ago
- finetune the chain model based on cvte open source model without traing any GMM for frame alignment☆12Updated 4 years ago
- ☆39Updated last year
- ☆15Updated 3 years ago
- wake word spotting with kaldi☆19Updated 4 years ago