coqui-ai / STTLinks
πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
β2,550Updated last year
Alternatives and similar repositories for STT
Users that are interested in STT are comparing it to the libraries listed below
Sorting:
- Examples of how to use or integrate DeepSpeechβ857Updated 2 years ago
- Open Text to Speech Serverβ1,118Updated last year
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,381Updated last year
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β10,103Updated 2 years ago
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice β¦β509Updated 2 years ago
- A python package to analyze and compare voices with deep learningβ3,203Updated 2 years ago
- A fast local neural text to speech engine for Mycroftβ1,244Updated 9 months ago
- Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license sβ¦β685Updated last month
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi librariesβ1,223Updated 5 months ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ1,050Updated last year
- Simple text to phones converter for multiple languagesβ1,501Updated last year
- An opensource text-to-speech (TTS) voice building toolβ681Updated last year
- Python interface to the WebRTC Voice Activity Detectorβ2,426Updated last year
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β842Updated 2 years ago
- πΈ collection of TTS papersβ721Updated last year
- eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.β6,044Updated last month
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,133Updated last year
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β1,003Updated 7 months ago
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germaβ¦β3,992Updated last year
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"β2,142Updated 2 years ago
- Unified-Modal Speech-Text Pre-Training for Spoken Language Processingβ1,426Updated last year
- End-to-End Speech Processing Toolkitβ9,695Updated last week
- π A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).β2,119Updated last year
- An Open Source text-to-speech system built by inverting Whisper.β4,549Updated last month
- Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Nodeβ14,081Updated last month
- Silero Models: pre-trained text-to-speech models made embarrassingly simpleβ5,719Updated 3 weeks ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inferenceβ5,300Updated last year
- A fast, local neural text to speech systemβ10,449Updated 4 months ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.β1,835Updated 6 months ago
- A python package to build AI-powered real-time audio applicationsβ1,910Updated 11 months ago