Picovoice / cheetah
On-device streaming speech-to-text engine powered by deep learning
☆619Updated this week
Alternatives and similar repositories for cheetah:
Users that are interested in cheetah are comparing it to the libraries listed below
- On-device Speech-to-Intent engine powered by deep learning☆648Updated this week
- speech to text benchmark framework☆639Updated last month
- A lightweight, simple-to-use, RNN wake word listener☆887Updated last year
- On-device speech-to-text engine powered by deep learning☆448Updated this week
- Open tools and data for cloudless automatic speech recognition☆447Updated 3 years ago
- On-device voice assistant platform powered by deep learning☆627Updated last month
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.☆582Updated 3 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆203Updated 7 months ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,316Updated 9 months ago
- 🐸STT integration examples☆126Updated 2 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆535Updated 3 years ago
- Examples of how to use or integrate DeepSpeech☆843Updated last year
- Dockerfile for kaldi-gstreamer-server.☆289Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆202Updated this week
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆943Updated 6 months ago
- On-device wake word detection powered by deep learning☆3,993Updated this week
- OneShot Learning-based hotword detection.☆252Updated 6 months ago
- wake word engine benchmark framework☆133Updated 3 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆468Updated 5 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆102Updated 4 years ago
- Efficient neural speech synthesis☆1,160Updated 6 months ago
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice …☆506Updated last year
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subw…☆962Updated this week
- Web application to record speech for an open data set☆421Updated 4 years ago
- g2p: English Grapheme To Phoneme Conversion☆842Updated 2 years ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆438Updated 4 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆165Updated 9 months ago
- MelGAN vocoder (compatible with NVIDIA/tacotron2)☆644Updated 4 years ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆340Updated last year