solyarisoftware / CoquiSTTJs
Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.
☆27Updated 3 years ago
Alternatives and similar repositories for CoquiSTTJs:
Users that are interested in CoquiSTTJs are comparing it to the libraries listed below
- An even smaller speech recognizer / force aligner☆32Updated last month
- Coqui AI TTS plugin☆72Updated 4 months ago
- Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.☆44Updated 3 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆22Updated 5 months ago
- Create an LJSpeech structured voice dataset on wave input☆23Updated 3 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- Speaker Diarization with Transformers☆61Updated 7 months ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆90Updated 3 months ago
- A curated list of awesome OpenAI's Whisper☆96Updated last year
- Web app for keyword spotting using TensorflowJS☆69Updated 2 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆123Updated 2 months ago
- C++ library for converting text to phonemes for Piper☆99Updated 10 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆108Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆144Updated 8 months ago
- A simple voice conversion tool☆17Updated 2 years ago
- ☆255Updated 7 months ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- Open models for Coqui STT☆127Updated last year
- Your one-stop solution for voice dataset creation☆117Updated last year
- Simple Diarization model☆46Updated last year
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- An automatic speech recognition API☆48Updated this week
- Faster Tortoise inference then Tortoise Fast Fork☆126Updated 8 months ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆226Updated 7 months ago
- Misc. tools/scripts that I made to use for tortoise☆21Updated 4 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆53Updated last month
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆168Updated 3 months ago
- generate granular word-level captions in srt format☆57Updated 2 years ago