coqui-ai / STT-modelsLinks
Open models for Coqui STT
β146Updated 2 years ago
Alternatives and similar repositories for STT-models
Users that are interested in STT-models are comparing it to the libraries listed below
Sorting:
- On-device voice activity detection (VAD) powered by deep learningβ233Updated last month
- πΈSTT integration examplesβ129Updated 3 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.β373Updated last year
- openvino version of openai/whisperβ176Updated 2 years ago
- C++ library for converting text to phonemes for Piperβ134Updated 4 months ago
- Voice models for Mimic 3 text to speech systemβ157Updated last year
- πΈ - A general purpose model trainer, as flexible as it getsβ227Updated last year
- β350Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.β119Updated 2 years ago
- A curated list of awesome voice activity detectionβ68Updated 11 months ago
- Experiments to test different speech recognition systems for SEPIA Frameworkβ63Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β328Updated last year
- Desktop application for neural speech synthesis written in C++β213Updated 2 years ago
- On-device noise suppression powered by deep learningβ76Updated 3 months ago
- SEPIA server to support open-source speech recognition via WebSocket connection.β134Updated last year
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event β¦β406Updated last year
- On-device streaming text-to-speech engine powered by deep learningβ122Updated 2 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ97Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)β102Updated 2 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsβ347Updated last year
- An automatic speech recognition APIβ73Updated this week
- ONNX-compatible Fast SeamlessM4TβMassively Multilingual & Multimodal Machine Translationβ43Updated 2 years ago
- ONNX Inference of Pyannote Segmentationβ95Updated 10 months ago
- Batch Support for OpenAI Whisperβ95Updated last year
- NeMo text processing for ASR and TTSβ384Updated last week
- Metadata and versioning details for the Common Voice datasetβ160Updated last month
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisperβ31Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Updated 2 years ago
- β202Updated 3 years ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.β67Updated last year