coqui-ai / STTLinks
πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
β2,523Updated last year
Alternatives and similar repositories for STT
Users that are interested in STT are comparing it to the libraries listed below
Sorting:
- Examples of how to use or integrate DeepSpeechβ857Updated 2 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,364Updated last year
- A fast local neural text to speech engine for Mycroftβ1,227Updated 7 months ago
- Open Text to Speech Serverβ1,110Updated last year
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice β¦β512Updated 2 years ago
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β994Updated 4 months ago
- A PyTorch-based Speech Toolkitβ10,683Updated this week
- A python package to analyze and compare voices with deep learningβ3,132Updated 2 years ago
- On-device streaming speech-to-text engine powered by deep learningβ641Updated last week
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ1,030Updated 11 months ago
- π€π¬ Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.β1,154Updated last year
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β840Updated 2 years ago
- Silero VAD: pre-trained enterprise-grade Voice Activity Detectorβ7,188Updated last week
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germaβ¦β3,985Updated last year
- Python interface to the WebRTC Voice Activity Detectorβ2,383Updated last year
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β10,038Updated last year
- πΈ collection of TTS papersβ717Updated last year
- A python package to build AI-powered real-time audio applicationsβ1,489Updated 8 months ago
- Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license sβ¦β657Updated 9 months ago
- Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simpleβ5,524Updated 2 years ago
- On-device wake word detection powered by deep learningβ4,463Updated last week
- Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Nodeβ13,483Updated last week
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"β2,098Updated 2 years ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.β1,812Updated 3 months ago
- A lightweight, simple-to-use, RNN wake word listenerβ939Updated last year
- End-to-End Speech Processing Toolkitβ9,537Updated this week
- WaveRNN Vocoder + TTSβ2,173Updated 3 years ago
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Eβ¦β1,833Updated 2 years ago
- Facebook AI Research's Automatic Speech Recognition Toolkitβ6,442Updated last week
- An opensource text-to-speech (TTS) voice building toolβ680Updated last year