coqui-ai / STTLinks
πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
β2,502Updated last year
Alternatives and similar repositories for STT
Users that are interested in STT are comparing it to the libraries listed below
Sorting:
- Examples of how to use or integrate DeepSpeechβ857Updated 2 years ago
- A fast local neural text to speech engine for Mycroftβ1,212Updated 5 months ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,354Updated last year
- Open Text to Speech Serverβ1,097Updated last year
- eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.β5,476Updated this week
- A python package to build AI-powered real-time audio applicationsβ1,415Updated 6 months ago
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi librariesβ1,154Updated last month
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β9,973Updated last year
- A python package to analyze and compare voices with deep learningβ3,069Updated last year
- Python interface to the WebRTC Voice Activity Detectorβ2,336Updated last year
- Silero VAD: pre-trained enterprise-grade Voice Activity Detectorβ6,685Updated this week
- Open-Source Large Vocabulary Continuous Speech Recognition Engineβ1,899Updated 2 months ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ1,016Updated 9 months ago
- Simple text to phones converter for multiple languagesβ1,444Updated 11 months ago
- A PyTorch-based Speech Toolkitβ10,336Updated 2 weeks ago
- Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license sβ¦β645Updated 7 months ago
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice β¦β513Updated 2 years ago
- VOSK Speech Recognition Toolkitβ467Updated 3 years ago
- πΈ collection of TTS papersβ711Updated last year
- On-device streaming speech-to-text engine powered by deep learningβ635Updated last week
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β838Updated last year
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germaβ¦β3,966Updated last year
- An Open Source text-to-speech system built by inverting Whisper.β4,347Updated 2 months ago
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speakerβ¦β8,165Updated this week
- Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simpleβ5,456Updated last year
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.β1,791Updated last month
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β992Updated 2 months ago
- Unified-Modal Speech-Text Pre-Training for Spoken Language Processingβ1,395Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translationβ3,248Updated 2 months ago
- A fast, local neural text to speech systemβ9,919Updated this week