dtreskunov / tiny-kaldi
Kaldi API for Android, Python and Node. Forked from vosk-api with minimal modifications.
☆16Updated 4 years ago
Alternatives and similar repositories for tiny-kaldi:
Users that are interested in tiny-kaldi are comparing it to the libraries listed below
- Kaldi based speaker verification☆47Updated 7 years ago
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆13Updated 3 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Updated last year
- A converter from Arpabet to IPA (see https://en.wikipedia.org/wiki/Arpabet)☆16Updated 7 years ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆28Updated last year
- A demo of android key word spoting based on tensorflow tutial example☆27Updated 4 years ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆19Updated last year
- Goodness of Pronunciation algorithm using PyKaldi☆15Updated 2 years ago
- STT Service based on Kaldi ASR☆15Updated 6 years ago
- ☆17Updated last year
- A handy dataset of noises for ASR☆20Updated 5 years ago
- How to create your own model for vosk☆70Updated 3 years ago
- Online streaming speaker change detection model in Pytorch☆38Updated last year
- wake word spotting with kaldi☆19Updated 4 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆110Updated 2 years ago
- ☆15Updated 3 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Python Wrapper of Silero VAD☆48Updated 3 months ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 5 years ago
- Apply machine learning model DTLN for noise suppression and acoustic echo cancellation on Raspberry Pi☆63Updated 3 years ago
- A library for adding punctuation into a text from ASR.☆16Updated last year
- ☆80Updated 10 months ago
- Speaker Diarization library in Python. Performs VAD, Segmentation, Linear Clustering, Hierarchical Clustering☆15Updated 7 years ago
- Colab notebooks for Next-gen Kaldi☆26Updated last month
- Pre-trained grapheme-to-phoneme (G2P) models☆25Updated 3 years ago
- Tacotron text to speech in C++(synthesize only)☆76Updated 5 years ago
- Perform the forced decoding with target transcription☆11Updated 6 years ago
- ☆33Updated 3 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- Went online decode demo☆29Updated 3 years ago