coqui-ai / STTLinks
πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
β2,515Updated last year
Alternatives and similar repositories for STT
Users that are interested in STT are comparing it to the libraries listed below
Sorting:
- Examples of how to use or integrate DeepSpeechβ857Updated 2 years ago
- A fast local neural text to speech engine for Mycroftβ1,213Updated 5 months ago
- Open Text to Speech Serverβ1,100Updated last year
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β10,003Updated last year
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice β¦β512Updated 2 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,357Updated last year
- Python interface to the WebRTC Voice Activity Detectorβ2,349Updated last year
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi librariesβ1,163Updated last month
- A python package to analyze and compare voices with deep learningβ3,088Updated last year
- On-device streaming speech-to-text engine powered by deep learningβ637Updated this week
- eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.β5,550Updated last week
- An opensource text-to-speech (TTS) voice building toolβ680Updated last year
- Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simpleβ5,475Updated last year
- Controllable and fast Text-to-Speech for over 7000 languages!β1,637Updated 2 months ago
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.β584Updated 4 years ago
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β995Updated 3 months ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ1,021Updated 10 months ago
- Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license sβ¦β650Updated 8 months ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β838Updated last year
- An Open Source text-to-speech system built by inverting Whisper.β4,356Updated 3 months ago
- VOSK Speech Recognition Toolkitβ474Updated 3 years ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ5,957Updated last year
- On-device voice assistant platform powered by deep learningβ668Updated 5 months ago
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germaβ¦β3,974Updated last year
- π A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).β2,010Updated last year
- A python package to build AI-powered real-time audio applicationsβ1,460Updated 7 months ago
- On-device Speech-to-Intent engine powered by deep learningβ680Updated this week
- Unified-Modal Speech-Text Pre-Training for Spoken Language Processingβ1,397Updated last year
- Command-line tools for speech and intent recognition on Linuxβ1,110Updated last year
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"β2,090Updated last year