solyarisoftware / CoquiSTTJsLinks
Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.
☆30Updated 4 years ago
Alternatives and similar repositories for CoquiSTTJs
Users that are interested in CoquiSTTJs are comparing it to the libraries listed below
Sorting:
- Web app for keyword spotting using TensorflowJS☆74Updated 3 years ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆128Updated 6 months ago
- Evaluate results from ASR/Speech-to-Text quickly☆41Updated 4 years ago
- On-device voice activity detection (VAD) powered by deep learning☆242Updated 2 weeks ago
- SoTA open-source TTS☆135Updated 8 months ago
- 🐸STT integration examples☆130Updated 3 years ago
- Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.☆57Updated 4 years ago
- Faster Tortoise inference then Tortoise Fast Fork☆127Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Updated 4 years ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆219Updated 9 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆154Updated last year
- zero-shot realtime TTS system, fully offline, free and open source☆50Updated 9 months ago
- An automatic speech recognition API☆79Updated last week
- ☆44Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆74Updated 6 months ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆258Updated last year
- ☆476Updated this week
- ☆56Updated 3 weeks ago
- [WIP] VoiceSmith makes training text to speech models easy.☆228Updated 3 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆135Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆260Updated 2 months ago
- An even smaller speech recognizer / force aligner☆37Updated last year
- Speaker Diarization with Transformers☆70Updated 8 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆48Updated 4 months ago
- ☆80Updated last week
- Your one-stop solution for voice dataset creation☆128Updated 2 years ago
- Efficient approach to speaker diarization using voice characteristics extraction☆106Updated 7 months ago