solyarisoftware / CoquiSTTJsLinks
Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.
☆30Updated 4 years ago
Alternatives and similar repositories for CoquiSTTJs
Users that are interested in CoquiSTTJs are comparing it to the libraries listed below
Sorting:
- Web app for keyword spotting using TensorflowJS☆74Updated 3 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆135Updated last year
- Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.☆56Updated 4 years ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Updated 2 months ago
- 🐸STT integration examples☆130Updated 3 years ago
- On-device voice activity detection (VAD) powered by deep learning☆242Updated 2 weeks ago
- An automatic speech recognition API☆79Updated last week
- Faster Tortoise inference then Tortoise Fast Fork☆127Updated last year
- Speaker Diarization with Transformers☆70Updated 8 months ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- Community framework for training tortoise☆44Updated 3 years ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆74Updated 6 months ago
- [WIP] VoiceSmith makes training text to speech models easy.☆228Updated 3 years ago
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆260Updated 2 months ago
- ☆44Updated last year
- An even smaller speech recognizer / force aligner☆37Updated last year
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆46Updated 2 years ago
- Evaluate results from ASR/Speech-to-Text quickly☆41Updated 4 years ago
- Speaker diarization model☆32Updated 2 years ago
- Performant and accurate speech recognition built on Pytorch☆254Updated 3 years ago
- Coqui AI TTS plugin☆85Updated 7 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Updated 10 months ago
- Create an LJSpeech structured voice dataset on wave input☆37Updated last year
- Uses ctypes and libespeak-ng to transform test into IPA phonemes☆26Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆154Updated last year
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆131Updated 2 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Updated 4 years ago