coqui-ai / STTLinks
πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
β2,436Updated last year
Alternatives and similar repositories for STT
Users that are interested in STT are comparing it to the libraries listed below
Sorting:
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β9,855Updated last year
- Examples of how to use or integrate DeepSpeechβ851Updated last year
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,326Updated 11 months ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ974Updated 6 months ago
- Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Nodeβ9,567Updated 3 weeks ago
- πΈ collection of TTS papersβ690Updated 10 months ago
- Unified-Modal Speech-Text Pre-Training for Spoken Language Processingβ1,357Updated last year
- Open Text to Speech Serverβ1,056Updated last year
- A python package to analyze and compare voices with deep learningβ2,951Updated last year
- Silero VAD: pre-trained enterprise-grade Voice Activity Detectorβ5,876Updated 2 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ5,750Updated 9 months ago
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germaβ¦β3,929Updated 10 months ago
- A PyTorch-based Speech Toolkitβ9,885Updated this week
- Whisper realtime streaming for long speech-to-text transcription and translationβ2,918Updated 4 months ago
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β975Updated this week
- π€π¬ Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.β1,147Updated last year
- An Open Source text-to-speech system built by inverting Whisper.β4,257Updated last month
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi librariesβ1,048Updated this week
- eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.β5,109Updated this week
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"β2,029Updated last year
- Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license sβ¦β615Updated 4 months ago
- A fast local neural text to speech engine for Mycroftβ1,186Updated 2 months ago
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ40,319Updated 9 months ago
- End to end text to speech system using gruut and onnxβ830Updated last year
- Simple text to phones converter for multiple languagesβ1,387Updated 8 months ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β836Updated last year
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice β¦β509Updated 2 years ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.β1,748Updated 7 months ago
- Streaming transcriber with whisperβ686Updated 2 years ago
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Eβ¦β1,779Updated 2 years ago