Picovoice / speech-to-text-benchmark
speech to text benchmark framework
☆619Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for speech-to-text-benchmark
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆942Updated 2 months ago
- Open tools and data for cloudless automatic speech recognition☆443Updated 3 years ago
- SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/☆880Updated 3 years ago
- A method to generate speech across multiple speakers☆872Updated 5 years ago
- wake word engine benchmark framework☆131Updated 2 years ago
- On-device streaming speech-to-text engine powered by deep learning☆594Updated 2 weeks ago
- Dockerfile for kaldi-gstreamer-server.☆288Updated 2 years ago
- Web application to record speech for an open data set☆421Updated 4 years ago
- An On-Premises, Streaming Speech Recognition System☆681Updated 2 years ago
- Identify a spoken language using artificial intelligence (LID).☆123Updated 6 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆532Updated 2 years ago
- This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…☆1,560Updated last month
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.☆1,073Updated 5 months ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,283Updated 5 months ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆465Updated 4 years ago
- FastCGI support for Kaldi ASR☆184Updated 5 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- RNN-based generative models for speech.☆611Updated 7 years ago
- Program to benchmark various speech recognition APIs☆79Updated 5 years ago
- A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model☆1,828Updated 2 years ago
- Open source audio annotation tool for humans☆1,062Updated 2 months ago
- Efficient neural speech synthesis☆1,142Updated 2 months ago
- End-2-end speech synthesis with recurrent neural networks☆225Updated 8 months ago
- Tensorflow Implementation of Deep Voice 3☆453Updated 6 years ago
- Making a TTS model with 1 minute of speech samples within 10 minutes☆184Updated 6 years ago
- A testing server for a speech to text service based on coqui.ai☆215Updated 2 years ago
- Implementation of Google's Tacotron in TensorFlow☆236Updated 6 years ago
- GStreamer plugin around Kaldi's online neural network decoder☆185Updated 4 years ago
- g2p: English Grapheme To Phoneme Conversion☆811Updated last year
- Speaker diarization scripts, based on AaltoASR☆190Updated 5 years ago