coqui-ai / STTLinks
πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
β2,558Updated last year
Alternatives and similar repositories for STT
Users that are interested in STT are comparing it to the libraries listed below
Sorting:
- Open Text to Speech Serverβ1,119Updated last year
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β10,111Updated 2 years ago
- Examples of how to use or integrate DeepSpeechβ857Updated 2 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,384Updated last year
- A fast local neural text to speech engine for Mycroftβ1,246Updated 10 months ago
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice β¦β510Updated 2 years ago
- A python package to analyze and compare voices with deep learningβ3,213Updated 2 years ago
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi librariesβ1,228Updated 6 months ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ1,054Updated last year
- WaveRNN Vocoder + TTSβ2,177Updated 3 years ago
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β1,005Updated 8 months ago
- Python interface to the WebRTC Voice Activity Detectorβ2,443Updated last year
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"β2,146Updated 2 years ago
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germaβ¦β3,994Updated last year
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β844Updated 2 years ago
- Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license sβ¦β694Updated last week
- Unified-Modal Speech-Text Pre-Training for Spoken Language Processingβ1,431Updated last year
- eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.β6,121Updated 3 weeks ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inferenceβ5,304Updated last year
- A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)β2,989Updated 2 years ago
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Eβ¦β1,878Updated 2 years ago
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.β586Updated 4 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesisβ2,316Updated last year
- Simple text to phones converter for multiple languagesβ1,511Updated last year
- A small fast portable speech synthesis systemβ1,025Updated last year
- π€π¬ Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.β1,159Updated last year
- On-device streaming speech-to-text engine powered by deep learningβ655Updated last week
- DeepMind's Tacotron-2 Tensorflow implementationβ2,318Updated 2 years ago
- A PyTorch-based Speech Toolkitβ11,203Updated this week
- An Open Source text-to-speech system built by inverting Whisper.β4,555Updated last month