solyarisoftware / CoquiSTTJs
Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.
β27Updated 3 years ago
Alternatives and similar repositories for CoquiSTTJs:
Users that are interested in CoquiSTTJs are comparing it to the libraries listed below
- π Coqui's machine learning job schedulerβ32Updated 3 years ago
- Speaker Diarization with Transformersβ64Updated 11 months ago
- An even smaller speech recognizer / force alignerβ32Updated 4 months ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated 2 years ago
- πΈSTT integration examplesβ127Updated 2 years ago
- zero-shot realtime TTS system, fully offline, free and open sourceβ34Updated last week
- Create an LJSpeech structured voice dataset on wave inputβ28Updated 6 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ94Updated 11 months ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, reβ¦β47Updated last year
- πΌ Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decompositionβ16Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β62Updated 2 weeks ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β95Updated 6 months ago
- On-device speaker diarization powered by deep learningβ43Updated last month
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a lβ¦β23Updated 8 months ago
- Evaluate results from ASR/Speech-to-Text quicklyβ37Updated 3 years ago
- Web app for keyword spotting using TensorflowJSβ71Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learningβ206Updated last week
- Misc. tools/scripts that I made to use for tortoiseβ21Updated 8 months ago
- Google's SoundStorm: Efficient Parallel Audio Generationβ131Updated last year
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voiceβ10Updated 4 years ago
- πΈTTS recipes for different datasetsβ86Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β135Updated last year
- Create training data for training a voice cloner for bark text to speech.β44Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisperβ112Updated 2 years ago
- Coqui AI TTS pluginβ74Updated last month
- Real-Time Whisper Voice Recognition with vosk model feedback.β112Updated last year
- The EveryVoice TTS Toolkit - Text To Speech for your languageβ26Updated last week
- A TTS model capable of generating ultra-realistic dialogue in one pass.β75Updated this week
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.β48Updated last week
- π« check your data, before you wreck your modelβ16Updated 2 years ago