Picovoice / speech-to-text-benchmarkLinks
speech to text benchmark framework
☆654Updated last week
Alternatives and similar repositories for speech-to-text-benchmark
Users that are interested in speech-to-text-benchmark are comparing it to the libraries listed below
Sorting:
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆941Updated 10 months ago
- SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/☆882Updated 6 months ago
- Open tools and data for cloudless automatic speech recognition☆446Updated 4 years ago
- An On-Premises, Streaming Speech Recognition System☆684Updated 3 years ago
- Open source audio annotation tool for humans☆1,103Updated 5 months ago
- Making a TTS model with 1 minute of speech samples within 10 minutes☆184Updated 7 years ago
- Web application to record speech for an open data set☆424Updated 5 years ago
- Dockerfile for kaldi-gstreamer-server.☆289Updated 3 years ago
- On-device streaming speech-to-text engine powered by deep learning☆634Updated last week
- A method to generate speech across multiple speakers☆872Updated 6 years ago
- Identify a spoken language using artificial intelligence (LID).☆123Updated 7 years ago
- On-device speech-to-text engine powered by deep learning☆457Updated this week
- Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.☆541Updated 3 months ago
- On-device Speech-to-Intent engine powered by deep learning☆670Updated last week
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.☆1,086Updated last year
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆342Updated 2 years ago
- RNN-based generative models for speech.☆610Updated 8 years ago
- wake word engine benchmark framework☆137Updated 3 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆475Updated 5 years ago
- This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…☆1,579Updated 9 months ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆293Updated 3 years ago
- This is now the official location of the Merlin project.☆1,315Updated 5 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.☆129Updated 4 years ago
- A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model☆1,831Updated 3 years ago
- A webpage and API for using Mozilla DeepSpeech☆47Updated 4 years ago
- aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)☆2,691Updated last year
- Tensorflow Implementation of Deep Voice 3☆450Updated 7 years ago
- Implementation of Google's Tacotron in TensorFlow☆235Updated 7 years ago
- Speaker diarization scripts, based on AaltoASR☆190Updated 6 years ago
- A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket conne…☆217Updated 5 years ago