coqui-ai / STTLinks
πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
β2,539Updated last year
Alternatives and similar repositories for STT
Users that are interested in STT are comparing it to the libraries listed below
Sorting:
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β10,082Updated 2 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,371Updated last year
- A fast local neural text to speech engine for Mycroftβ1,240Updated 8 months ago
- Examples of how to use or integrate DeepSpeechβ856Updated 2 years ago
- Open Text to Speech Serverβ1,115Updated last year
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice β¦β510Updated 2 years ago
- A python package to analyze and compare voices with deep learningβ3,172Updated 2 years ago
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germaβ¦β3,989Updated last year
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β996Updated 6 months ago
- Silero Models: pre-trained text-to-speech models made embarrassingly simpleβ5,655Updated last week
- An Open Source text-to-speech system built by inverting Whisper.β4,539Updated 6 months ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ1,038Updated last year
- Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license sβ¦β674Updated 11 months ago
- Python interface to the WebRTC Voice Activity Detectorβ2,411Updated last year
- An opensource text-to-speech (TTS) voice building toolβ679Updated last year
- A fast, local neural text to speech systemβ10,309Updated 3 months ago
- On-device streaming speech-to-text engine powered by deep learningβ644Updated this week
- Silero VAD: pre-trained enterprise-grade Voice Activity Detectorβ7,573Updated this week
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,082Updated last year
- eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.β5,904Updated 2 weeks ago
- A PyTorch-based Speech Toolkitβ10,886Updated last week
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β842Updated 2 years ago
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi librariesβ1,206Updated 4 months ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"β2,116Updated 2 years ago
- Unified-Modal Speech-Text Pre-Training for Spoken Language Processingβ1,418Updated last year
- π€π¬ Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.β1,156Updated last year
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Eβ¦β1,853Updated 2 years ago
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speakerβ¦β8,789Updated this week
- A python package to build AI-powered real-time audio applicationsβ1,882Updated 10 months ago
- πΈ collection of TTS papersβ719Updated last year