solyarisoftware / CoquiSTTJsLinks
Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.
☆30Updated 4 years ago
Alternatives and similar repositories for CoquiSTTJs
Users that are interested in CoquiSTTJs are comparing it to the libraries listed below
Sorting:
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.☆128Updated 9 months ago
- 🐸STT integration examples☆130Updated 2 years ago
- Create an LJSpeech structured voice dataset on wave input☆33Updated 10 months ago
- Community framework for training tortoise☆43Updated 2 years ago
- SoTA open-source TTS☆72Updated 2 months ago
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated last year
- Web app for keyword spotting using TensorflowJS☆73Updated 2 years ago
- ☆273Updated last year
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆254Updated last year
- Performant and accurate speech recognition built on Pytorch☆253Updated 3 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆321Updated 8 months ago
- Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.☆50Updated 4 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- Coqui AI TTS plugin☆85Updated last month
- Speaker Diarization with Transformers☆69Updated 2 months ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆248Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- An even smaller speech recognizer / force aligner☆35Updated 7 months ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- Voice models for Mimic 3 text to speech system☆150Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- An automatic speech recognition API☆66Updated 3 weeks ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆114Updated 2 weeks ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆42Updated last week
- Uses ctypes and libespeak-ng to transform test into IPA phonemes☆21Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆98Updated last month
- On-device voice activity detection (VAD) powered by deep learning☆222Updated 2 weeks ago
- ☆43Updated last year