Picovoice / speech-to-text-benchmarkLinks
speech to text benchmark framework
☆651Updated 4 months ago
Alternatives and similar repositories for speech-to-text-benchmark
Users that are interested in speech-to-text-benchmark are comparing it to the libraries listed below
Sorting:
- A method to generate speech across multiple speakers☆873Updated 6 years ago
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆941Updated 9 months ago
- SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/☆882Updated 6 months ago
- Open tools and data for cloudless automatic speech recognition☆446Updated 4 years ago
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.☆1,085Updated last year
- Dockerfile for kaldi-gstreamer-server.☆289Updated 3 years ago
- On-device streaming speech-to-text engine powered by deep learning☆631Updated last week
- Web application to record speech for an open data set☆424Updated 5 years ago
- This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…☆1,578Updated 8 months ago
- Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.☆541Updated 3 months ago
- PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models☆1,981Updated last year
- A Python wrapper for Kaldi☆1,017Updated 4 months ago
- End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow☆2,840Updated 2 years ago
- Efficient neural speech synthesis☆1,174Updated 9 months ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆536Updated 3 years ago
- On-device speech-to-text engine powered by deep learning☆457Updated this week
- A testing server for a speech to text service based on coqui.ai☆215Updated 2 years ago
- This is now the official location of the Merlin project.☆1,314Updated 5 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆473Updated 5 years ago
- An On-Premises, Streaming Speech Recognition System☆683Updated 3 years ago
- A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model☆1,832Updated 3 years ago
- A Speaker Recognition System☆677Updated 5 years ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆293Updated 3 years ago
- An audio/acoustic activity detection and audio segmentation tool☆781Updated 6 months ago
- Facebook AI Research's Automatic Speech Recognition Toolkit☆6,432Updated 6 months ago
- Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.☆1,867Updated 2 years ago
- RNN-based generative models for speech.☆610Updated 7 years ago
- A Flow-based Generative Network for Speech Synthesis☆2,333Updated last year
- wake word engine benchmark framework☆137Updated 3 years ago
- A PyTorch Implementation of End-to-End Models for Speech-to-Text☆759Updated last year