Picovoice / cheetahLinks
On-device streaming speech-to-text engine powered by deep learning
☆632Updated last week
Alternatives and similar repositories for cheetah
Users that are interested in cheetah are comparing it to the libraries listed below
Sorting:
- On-device Speech-to-Intent engine powered by deep learning☆669Updated last week
- On-device speech-to-text engine powered by deep learning☆457Updated last week
- Open tools and data for cloudless automatic speech recognition☆446Updated 4 years ago
- Examples of how to use or integrate DeepSpeech☆852Updated last year
- A lightweight, simple-to-use, RNN wake word listener☆906Updated last year
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.☆584Updated 3 years ago
- On-device voice activity detection (VAD) powered by deep learning☆219Updated last week
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆208Updated 11 months ago
- On-device voice assistant platform powered by deep learning☆650Updated 2 months ago
- speech to text benchmark framework☆653Updated 4 months ago
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice …☆509Updated 2 years ago
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subw…☆983Updated 2 weeks ago
- OneShot Learning-based hotword detection.☆265Updated 9 months ago
- wake word engine benchmark framework☆137Updated 3 years ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,336Updated last year
- On-device wake word detection powered by deep learning☆4,202Updated last week
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆342Updated last year
- Dockerfile for kaldi-gstreamer-server.☆289Updated 3 years ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆542Updated 2 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆102Updated 5 years ago
- g2p: English Grapheme To Phoneme Conversion☆861Updated 2 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆537Updated 3 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆209Updated 3 years ago
- 🐸STT integration examples☆129Updated 2 years ago
- 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.☆2,459Updated last year
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆446Updated 5 years ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆861Updated last year
- Large, modern dataset for speech recognition☆678Updated last year
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech proc…☆370Updated last week
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).☆1,951Updated last year