solyarisoftware / CoquiSTTJs
Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.
☆25Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for CoquiSTTJs
- Web app for keyword spotting using TensorflowJS☆69Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆24Updated last year
- Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.☆44Updated 3 years ago
- 🐸TTS recipes for different datasets☆84Updated 2 years ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- An even smaller speech recognizer / force aligner☆32Updated 2 months ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆44Updated last year
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated 10 months ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 3 years ago
- brainless concatenative text to speech☆11Updated 3 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆98Updated last year
- Coqui AI TTS plugin☆67Updated last month
- SEPIA server to support open-source speech recognition via WebSocket connection.☆120Updated this week
- 🐍 Coqui's machine learning job scheduler☆32Updated 3 years ago
- On-device speaker diarization powered by deep learning☆25Updated last month
- 🐸STT integration examples☆121Updated 2 years ago
- On-device Speech-to-Index engine powered by deep learning☆34Updated last month
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆139Updated 6 months ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆133Updated last year
- ☆17Updated last year
- ☆16Updated 3 years ago
- ☆77Updated 5 months ago
- VALL-E 2 reproduction☆83Updated 3 months ago
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)☆61Updated 11 months ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- Evaluate results from ASR/Speech-to-Text quickly☆36Updated 2 years ago
- An automatic speech recognition API☆42Updated 2 months ago
- On-device voice activity detection (VAD) powered by deep learning☆173Updated 2 weeks ago