Picovoice / speech-to-text-benchmark
speech to text benchmark framework
☆625Updated last month
Alternatives and similar repositories for speech-to-text-benchmark:
Users that are interested in speech-to-text-benchmark are comparing it to the libraries listed below
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆943Updated 4 months ago
- SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/☆882Updated last month
- wake word engine benchmark framework☆131Updated 3 years ago
- Dockerfile for kaldi-gstreamer-server.☆289Updated 2 years ago
- On-device streaming speech-to-text engine powered by deep learning☆602Updated this week
- Open tools and data for cloudless automatic speech recognition☆446Updated 3 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆535Updated 2 years ago
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.☆1,077Updated 7 months ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆468Updated 4 years ago
- This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…☆1,566Updated 3 months ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,298Updated 7 months ago
- Web application to record speech for an open data set☆421Updated 4 years ago
- A method to generate speech across multiple speakers☆872Updated 5 years ago
- Open source audio annotation tool for humans☆1,071Updated 4 months ago
- An On-Premises, Streaming Speech Recognition System☆683Updated 3 years ago
- On-device Speech-to-Intent engine powered by deep learning☆634Updated this week
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆291Updated 3 years ago
- End-2-end speech synthesis with recurrent neural networks☆225Updated 10 months ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.☆535Updated 2 weeks ago
- A Python wrapper for Kaldi☆1,006Updated 5 months ago
- Tensorflow Implementation of Deep Voice 3☆453Updated 6 years ago
- Offline transcription system for Estonian using Kaldi☆227Updated 2 years ago
- PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models☆1,972Updated last year
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.☆129Updated 3 years ago
- Efficient neural speech synthesis☆1,147Updated 3 months ago
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆774Updated last week
- GStreamer plugin around Kaldi's online neural network decoder☆185Updated 4 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆202Updated 3 years ago
- g2p: English Grapheme To Phoneme Conversion☆831Updated 2 years ago