Picovoice / speech-to-text-benchmark
speech to text benchmark framework
☆630Updated last week
Alternatives and similar repositories for speech-to-text-benchmark:
Users that are interested in speech-to-text-benchmark are comparing it to the libraries listed below
- SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/☆883Updated 2 months ago
- Open tools and data for cloudless automatic speech recognition☆447Updated 3 years ago
- On-device streaming speech-to-text engine powered by deep learning☆609Updated this week
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆943Updated 5 months ago
- A method to generate speech across multiple speakers☆872Updated 5 years ago
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.☆1,076Updated 8 months ago
- Dockerfile for kaldi-gstreamer-server.☆289Updated 2 years ago
- Open source audio annotation tool for humans☆1,081Updated last week
- Web application to record speech for an open data set☆421Updated 4 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆535Updated 3 years ago
- RNN-based generative models for speech.☆611Updated 7 years ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆291Updated 3 years ago
- wake word engine benchmark framework☆132Updated 3 years ago
- End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow☆2,842Updated last year
- Making a TTS model with 1 minute of speech samples within 10 minutes☆184Updated 6 years ago
- An On-Premises, Streaming Speech Recognition System☆683Updated 3 years ago
- On-device Speech-to-Intent engine powered by deep learning☆640Updated this week
- On-device speech-to-text engine powered by deep learning☆448Updated this week
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆339Updated last year
- Python interface to the WebRTC Voice Activity Detector☆2,145Updated 7 months ago
- Facebook AI Research's Automatic Speech Recognition Toolkit☆6,406Updated 2 months ago
- Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.☆537Updated last week
- This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…☆1,570Updated 4 months ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆467Updated 4 years ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆434Updated 4 years ago
- FastCGI support for Kaldi ASR☆185Updated 5 years ago
- Deep Learning & 3D Convolutional Neural Networks for Speaker Verification☆782Updated 4 years ago
- A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket conne…☆216Updated 4 years ago
- The official repository of the Eesen project☆826Updated 5 years ago