coqui-ai / STTLinks
πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
β2,514Updated last year
Alternatives and similar repositories for STT
Users that are interested in STT are comparing it to the libraries listed below
Sorting:
- Examples of how to use or integrate DeepSpeechβ858Updated 2 years ago
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germaβ¦β3,982Updated last year
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,358Updated last year
- Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simpleβ5,497Updated last year
- A fast local neural text to speech engine for Mycroftβ1,222Updated 6 months ago
- A python package to analyze and compare voices with deep learningβ3,108Updated last year
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β10,016Updated last year
- Tacotron 2 - PyTorch implementation with faster-than-realtime inferenceβ5,273Updated last year
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β839Updated last year
- Unified-Modal Speech-Text Pre-Training for Spoken Language Processingβ1,397Updated last year
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β994Updated 3 months ago
- Open Text to Speech Serverβ1,106Updated last year
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ1,027Updated 11 months ago
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi librariesβ1,179Updated 2 months ago
- Silero VAD: pre-trained enterprise-grade Voice Activity Detectorβ7,008Updated last month
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice β¦β512Updated 2 years ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ5,981Updated last year
- Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license sβ¦β654Updated 8 months ago
- A PyTorch-based Speech Toolkitβ10,515Updated last week
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Eβ¦β1,825Updated 2 years ago
- WaveRNN Vocoder + TTSβ2,166Updated 3 years ago
- A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)β2,984Updated 2 years ago
- eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.β5,625Updated 3 weeks ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesisβ2,232Updated last year
- A python package to build AI-powered real-time audio applicationsβ1,471Updated 7 months ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"β2,093Updated last year
- π€π¬ Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.β1,153Updated last year
- π A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).β2,030Updated last year
- On-device streaming speech-to-text engine powered by deep learningβ639Updated 3 weeks ago
- β1,449Updated last year