solyarisoftware / CoquiSTTJsLinks
Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.
☆30Updated 4 years ago
Alternatives and similar repositories for CoquiSTTJs
Users that are interested in CoquiSTTJs are comparing it to the libraries listed below
Sorting:
- Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.☆51Updated 4 years ago
- Web app for keyword spotting using TensorflowJS☆74Updated 2 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆131Updated 11 months ago
- 🐸STT integration examples☆129Updated 3 years ago
- Faster Tortoise inference then Tortoise Fast Fork☆127Updated last year
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- Community framework for training tortoise☆44Updated 2 years ago
- Create an LJSpeech structured voice dataset on wave input☆36Updated last year
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆252Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆126Updated 3 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- SoTA open-source TTS☆103Updated 4 months ago
- Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesi…☆41Updated 4 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆20Updated 3 years ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆158Updated last year
- An automatic speech recognition API☆71Updated last month
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆69Updated 3 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆151Updated last year
- On-device speaker diarization powered by deep learning☆56Updated 2 months ago
- On-device voice activity detection (VAD) powered by deep learning☆231Updated last month
- [WIP] VoiceSmith makes training text to speech models easy.☆226Updated 3 years ago
- Efficient approach to speaker diarization using voice characteristics extraction☆102Updated 4 months ago
- Speaker Diarization with Transformers☆69Updated 4 months ago
- An even smaller speech recognizer / force aligner☆36Updated 10 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆43Updated last month
- Coqui AI TTS plugin☆87Updated 3 months ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆23Updated last year
- zero-shot realtime TTS system, fully offline, free and open source☆47Updated 6 months ago