coqui-ai / STTLinks
πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
β2,523Updated last year
Alternatives and similar repositories for STT
Users that are interested in STT are comparing it to the libraries listed below
Sorting:
- Examples of how to use or integrate DeepSpeechβ857Updated 2 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,364Updated last year
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β10,033Updated last year
- A python package to analyze and compare voices with deep learningβ3,132Updated 2 years ago
- Open Text to Speech Serverβ1,109Updated last year
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β839Updated 2 years ago
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germaβ¦β3,983Updated last year
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice β¦β511Updated 2 years ago
- A fast local neural text to speech engine for Mycroftβ1,224Updated 7 months ago
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β994Updated 4 months ago
- Python interface to the WebRTC Voice Activity Detectorβ2,378Updated last year
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi librariesβ1,186Updated 3 months ago
- eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.β5,720Updated 2 weeks ago
- An opensource text-to-speech (TTS) voice building toolβ680Updated last year
- Silero VAD: pre-trained enterprise-grade Voice Activity Detectorβ7,188Updated this week
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.β586Updated 4 years ago
- VOSK Speech Recognition Toolkitβ480Updated 3 years ago
- A small fast portable speech synthesis systemβ988Updated last year
- A python package to build AI-powered real-time audio applicationsβ1,489Updated 8 months ago
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Eβ¦β1,830Updated 2 years ago
- On-device streaming speech-to-text engine powered by deep learningβ641Updated last week
- Simple text to phones converter for multiple languagesβ1,470Updated last year
- A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)β2,985Updated 2 years ago
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speakerβ¦β8,556Updated this week
- WaveRNN Vocoder + TTSβ2,171Updated 3 years ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ1,028Updated 11 months ago
- πΈ collection of TTS papersβ717Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesisβ2,242Updated last year
- π A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).β2,050Updated last year
- A lightweight, simple-to-use, RNN wake word listenerβ939Updated last year