solyarisoftware / CoquiSTTJsLinks
Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.
β30Updated 4 years ago
Alternatives and similar repositories for CoquiSTTJs
Users that are interested in CoquiSTTJs are comparing it to the libraries listed below
Sorting:
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Updated 2 years ago
- πΈSTT integration examplesβ129Updated 3 years ago
- On-device voice activity detection (VAD) powered by deep learningβ238Updated last week
- An automatic speech recognition APIβ76Updated last month
- Speaker diarization modelβ32Updated 2 years ago
- Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.β55Updated 4 years ago
- Web app for keyword spotting using TensorflowJSβ74Updated 3 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ153Updated last year
- Evaluate results from ASR/Speech-to-Text quicklyβ40Updated 3 years ago
- β43Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ100Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.β134Updated last year
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammaβ¦β21Updated 3 years ago
- Simple diarization modelβ53Updated 6 months ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β329Updated last year
- SoTA open-source TTSβ120Updated 6 months ago
- β44Updated last year
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, reβ¦β46Updated 2 years ago
- An even smaller speech recognizer / force alignerβ37Updated last year
- Faster Tortoise inference then Tortoise Fast Forkβ128Updated last year
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ260Updated last month
- π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§π₯π Advanced audio processing.β256Updated last year
- π Coqui's machine learning job schedulerβ31Updated 4 years ago
- Synchronize Whisper's timestamps over an existing accurate transcriptionβ160Updated last year
- πΌ Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decompositionβ14Updated last month
- On-device speaker diarization powered by deep learningβ61Updated last week
- Coqui AI TTS pluginβ85Updated 5 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.β121Updated 2 years ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.β46Updated 3 months ago