solyarisoftware / CoquiSTTJsLinks
Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.
☆30Updated 4 years ago
Alternatives and similar repositories for CoquiSTTJs
Users that are interested in CoquiSTTJs are comparing it to the libraries listed below
Sorting:
- SEPIA server to support open-source speech recognition via WebSocket connection.☆135Updated last year
- Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.☆56Updated 4 years ago
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆261Updated 2 months ago
- Web app for keyword spotting using TensorflowJS☆74Updated 3 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆153Updated last year
- 🐸STT integration examples☆130Updated 3 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆331Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆241Updated last week
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆257Updated last year
- Uses ctypes and libespeak-ng to transform test into IPA phonemes☆25Updated 2 years ago
- Speaker Diarization with Transformers☆69Updated 7 months ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆375Updated last year
- Community framework for training tortoise☆44Updated 3 years ago
- ☆45Updated last year
- An automatic speech recognition API☆78Updated last month
- ☆443Updated 2 months ago
- [WIP] VoiceSmith makes training text to speech models easy.☆228Updated 3 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- Performant and accurate speech recognition built on Pytorch☆254Updated 3 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Updated 3 years ago
- SoTA open-source TTS☆131Updated 7 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆72Updated 6 months ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆46Updated 2 years ago
- Voice models for Mimic 3 text to speech system☆160Updated last year
- Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesi…☆41Updated 4 years ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Updated 2 months ago
- Evaluate results from ASR/Speech-to-Text quickly☆40Updated 4 years ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆131Updated 2 years ago