solyarisoftware / CoquiSTTJsLinks
Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.
☆29Updated 3 years ago
Alternatives and similar repositories for CoquiSTTJs
Users that are interested in CoquiSTTJs are comparing it to the libraries listed below
Sorting:
- An even smaller speech recognizer / force aligner☆33Updated 5 months ago
- Create an LJSpeech structured voice dataset on wave input☆30Updated 8 months ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆132Updated last year
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆127Updated 6 months ago
- 🐍 Coqui's machine learning job scheduler☆32Updated 3 years ago
- Web app for keyword spotting using TensorflowJS☆71Updated 2 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆98Updated 7 months ago
- zero-shot realtime TTS system, fully offline, free and open source☆41Updated last month
- Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.☆48Updated 3 years ago
- Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesi…☆41Updated 3 years ago
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆253Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated last year
- On-device speaker diarization powered by deep learning☆46Updated 3 weeks ago
- Buildings block for voice-enabled applications in the browser☆37Updated last month
- ☆43Updated 11 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆91Updated 2 weeks ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆23Updated 10 months ago
- On-device voice activity detection (VAD) powered by deep learning☆217Updated this week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated last week
- An automatic speech recognition API☆60Updated this week
- 🐸TTS recipes for different datasets☆87Updated 2 years ago
- 🐸STT integration examples☆128Updated 2 years ago
- VoiceBox neural network implementation☆108Updated 10 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆179Updated 8 months ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆21Updated 9 months ago
- On-device Speech-to-Index engine powered by deep learning☆36Updated last month
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆56Updated last month
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year