Picovoice / cheetah
On-device streaming speech-to-text engine powered by deep learning
☆609Updated last week
Alternatives and similar repositories for cheetah:
Users that are interested in cheetah are comparing it to the libraries listed below
- On-device Speech-to-Intent engine powered by deep learning☆641Updated this week
- On-device speech-to-text engine powered by deep learning☆448Updated this week
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆202Updated 6 months ago
- Open tools and data for cloudless automatic speech recognition☆447Updated 3 years ago
- speech to text benchmark framework☆631Updated last week
- On-device voice activity detection (VAD) powered by deep learning☆198Updated this week
- On-device voice assistant platform powered by deep learning☆618Updated last week
- A lightweight, simple-to-use, RNN wake word listener☆875Updated last year
- wake word engine benchmark framework☆132Updated 3 years ago
- Examples of how to use or integrate DeepSpeech☆836Updated last year
- 🐸STT integration examples☆125Updated 2 years ago
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.☆1,076Updated 8 months ago
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.☆581Updated 3 years ago
- A testing server for a speech to text service based on coqui.ai☆215Updated 2 years ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆467Updated 4 years ago
- On-device wake word detection powered by deep learning☆3,933Updated this week
- GStreamer plugin around Kaldi's online neural network decoder☆185Updated 4 years ago
- OneShot Learning-based hotword detection.☆247Updated 5 months ago
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆339Updated last year
- Voice Activity Detection based on Deep Learning & TensorFlow☆358Updated last year
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice …☆505Updated last year
- An audio/acoustic activity detection and audio segmentation tool☆765Updated 2 months ago
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆943Updated 5 months ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆102Updated 4 years ago
- g2p: English Grapheme To Phoneme Conversion☆836Updated 2 years ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆291Updated 3 years ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech proc…☆366Updated 2 months ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆434Updated 4 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆535Updated 3 years ago