coqui-ai / STT-modelsLinks
Open models for Coqui STT
☆148Updated 2 years ago
Alternatives and similar repositories for STT-models
Users that are interested in STT-models are comparing it to the libraries listed below
Sorting:
- On-device voice activity detection (VAD) powered by deep learning☆238Updated last week
- C++ library for converting text to phonemes for Piper☆137Updated 5 months ago
- 🐸STT integration examples☆129Updated 3 years ago
- 🐸 - A general purpose model trainer, as flexible as it gets☆230Updated last year
- openvino version of openai/whisper☆178Updated 2 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆374Updated last year
- ☆355Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- On-device noise suppression powered by deep learning☆78Updated last week
- Voice models for Mimic 3 text to speech system☆160Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆329Updated last year
- A curated list of awesome voice activity detection☆70Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆103Updated 4 months ago
- Desktop application for neural speech synthesis written in C++☆213Updated 2 years ago
- Tunable pipelines☆40Updated 3 months ago
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆260Updated last month
- Experiments to test different speech recognition systems for SEPIA Framework☆62Updated 2 years ago
- An automatic speech recognition API☆76Updated last month
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- ☆405Updated 2 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆67Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.☆134Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆347Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆153Updated last year
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated 2 years ago
- On-device streaming text-to-speech engine powered by deep learning☆122Updated last week
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆245Updated 4 months ago
- ONNX Inference of Pyannote Segmentation☆97Updated last year
- Speaker diarization model☆32Updated 2 years ago