solyarisoftware / CoquiSTTJs
Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.
☆28Updated 3 years ago
Alternatives and similar repositories for CoquiSTTJs
Users that are interested in CoquiSTTJs are comparing it to the libraries listed below
Sorting:
- Create an LJSpeech structured voice dataset on wave input☆29Updated 7 months ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated last year
- zero-shot realtime TTS system, fully offline, free and open source☆35Updated 3 weeks ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆23Updated 9 months ago
- Web app for keyword spotting using TensorflowJS☆71Updated 2 years ago
- An even smaller speech recognizer / force aligner☆32Updated 4 months ago
- Coqui AI TTS plugin☆74Updated 2 months ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated last month
- Speaker Diarization with Transformers☆64Updated 11 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆11Updated last month
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 7 months ago
- On-device speaker diarization powered by deep learning☆44Updated last week
- Simple Diarization model☆47Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆94Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.☆126Updated 6 months ago
- Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesi…☆40Updated 3 years ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆54Updated last month
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆63Updated last week
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆129Updated last year
- Google's SoundStorm: Efficient Parallel Audio Generation☆132Updated last year
- A curated list of awesome voice activity detection☆50Updated 5 months ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆59Updated 2 years ago
- Heteronym to Phoneme Parser☆18Updated last year
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆37Updated 5 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- OpenAI Whisper Prompt Examples☆52Updated last year
- ☆36Updated last year