coqui-ai / STTLinks
πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
β2,462Updated last year
Alternatives and similar repositories for STT
Users that are interested in STT are comparing it to the libraries listed below
Sorting:
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β9,893Updated last year
- Examples of how to use or integrate DeepSpeechβ852Updated last year
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,340Updated last year
- VOSK Speech Recognition Toolkitβ451Updated 2 years ago
- Silero VAD: pre-trained enterprise-grade Voice Activity Detectorβ6,199Updated 3 weeks ago
- Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simpleβ5,358Updated last year
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β983Updated 3 weeks ago
- On-device streaming speech-to-text engine powered by deep learningβ634Updated this week
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ989Updated 8 months ago
- Python interface to the WebRTC Voice Activity Detectorβ2,285Updated last year
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β838Updated last year
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice β¦β509Updated 2 years ago
- Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Nodeβ12,598Updated 2 months ago
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germaβ¦β3,948Updated last year
- Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style trβ¦β899Updated last year
- Simple text to phones converter for multiple languagesβ1,402Updated 9 months ago
- Open Text to Speech Serverβ1,071Updated last year
- eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.β5,233Updated 3 weeks ago
- A fast local neural text to speech engine for Mycroftβ1,196Updated 3 months ago
- An opensource text-to-speech (TTS) voice building toolβ677Updated 11 months ago
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.β584Updated 3 years ago
- Noise supression using deep filteringβ3,159Updated 8 months ago
- π A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).β1,957Updated last year
- π€π¬ Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.β1,149Updated last year
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"β2,052Updated last year
- Unified-Modal Speech-Text Pre-Training for Spoken Language Processingβ1,370Updated last year
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Eβ¦β1,797Updated 2 years ago
- Efficient neural speech synthesisβ1,177Updated 9 months ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β3,900Updated 5 months ago
- Open-Source Large Vocabulary Continuous Speech Recognition Engineβ1,893Updated 2 weeks ago