coqui-ai / STTLinks
πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
β2,462Updated last year
Alternatives and similar repositories for STT
Users that are interested in STT are comparing it to the libraries listed below
Sorting:
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β9,893Updated last year
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germaβ¦β3,949Updated last year
- A fast local neural text to speech engine for Mycroftβ1,196Updated 3 months ago
- Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simpleβ5,369Updated last year
- πΈ collection of TTS papersβ704Updated last year
- Examples of how to use or integrate DeepSpeechβ852Updated last year
- eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.β5,246Updated 3 weeks ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ5,828Updated 10 months ago
- On-device streaming speech-to-text engine powered by deep learningβ634Updated this week
- Open Text to Speech Serverβ1,071Updated last year
- Command-line tools for speech and intent recognition on Linuxβ1,111Updated last year
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,340Updated last year
- On-device speech-to-text engine powered by deep learningβ457Updated this week
- Unified-Modal Speech-Text Pre-Training for Spoken Language Processingβ1,370Updated last year
- Silero VAD: pre-trained enterprise-grade Voice Activity Detectorβ6,199Updated 3 weeks ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"β2,052Updated last year
- Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license sβ¦β625Updated 5 months ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β838Updated last year
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ992Updated 8 months ago
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.β584Updated 3 years ago
- On-device voice assistant platform powered by deep learningβ650Updated 2 months ago
- An Open Source text-to-speech system built by inverting Whisper.β4,298Updated last month
- A small fast portable speech synthesis systemβ979Updated last year
- A fast, local neural text to speech systemβ9,529Updated last week
- Facebook AI Research's Automatic Speech Recognition Toolkitβ6,433Updated 7 months ago
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice β¦β509Updated 2 years ago
- On-device Speech-to-Intent engine powered by deep learningβ670Updated last week
- A multi-voice TTS system trained with an emphasis on qualityβ14,371Updated 7 months ago
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β983Updated 3 weeks ago
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ41,195Updated 10 months ago