coqui-ai / STTLinks
πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
β2,534Updated last year
Alternatives and similar repositories for STT
Users that are interested in STT are comparing it to the libraries listed below
Sorting:
- Examples of how to use or integrate DeepSpeechβ857Updated 2 years ago
- Open Text to Speech Serverβ1,114Updated last year
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi librariesβ1,200Updated 3 months ago
- A fast local neural text to speech engine for Mycroftβ1,239Updated 7 months ago
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β995Updated 5 months ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,364Updated last year
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β10,054Updated 2 years ago
- A python package to analyze and compare voices with deep learningβ3,153Updated 2 years ago
- Python interface to the WebRTC Voice Activity Detectorβ2,400Updated last year
- Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Nodeβ13,643Updated 3 weeks ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ1,037Updated last year
- Silero VAD: pre-trained enterprise-grade Voice Activity Detectorβ7,385Updated last week
- VOSK Speech Recognition Toolkitβ483Updated 3 years ago
- End-to-End Speech Processing Toolkitβ9,592Updated this week
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germaβ¦β3,987Updated last year
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice β¦β512Updated 2 years ago
- π€π¬ Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.β1,156Updated last year
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"β2,107Updated 2 years ago
- πΈ collection of TTS papersβ717Updated last year
- π A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).β2,076Updated last year
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β841Updated 2 years ago
- An Open Source text-to-speech system built by inverting Whisper.β4,530Updated 5 months ago
- On-device streaming speech-to-text engine powered by deep learningβ641Updated this week
- Command-line tools for speech and intent recognition on Linuxβ1,107Updated last year
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Eβ¦β1,844Updated 2 years ago
- Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license sβ¦β665Updated 10 months ago
- Silero Models: pre-trained text-to-speech models made embarrassingly simpleβ5,569Updated this week
- A PyTorch-based Speech Toolkitβ10,818Updated this week
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesisβ2,266Updated last year
- Facebook AI Research's Automatic Speech Recognition Toolkitβ6,442Updated 2 weeks ago