SEPIA-Framework / sepia-stt-server
SEPIA server to support open-source speech recognition via WebSocket connection.
☆124Updated 5 months ago
Alternatives and similar repositories for sepia-stt-server:
Users that are interested in sepia-stt-server are comparing it to the libraries listed below
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆204Updated this week
- An even smaller speech recognizer / force aligner☆32Updated 3 months ago
- 🐸STT integration examples☆127Updated 2 years ago
- Application to communicate with SEPIA via browser, iOS and Android. Works as chat messenger with personal-assistant, ASR and TTS integrat…☆63Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.☆309Updated 4 months ago
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆247Updated last year
- Open models for Coqui STT☆135Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆111Updated last year
- Zero-shot Audio Classification using Whisper☆80Updated 2 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated last year
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Coqui AI TTS plugin☆74Updated 3 weeks ago
- Docker images for Coqui AI☆57Updated 3 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 6 months ago
- Desktop application for neural speech synthesis written in C++☆214Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Updated 4 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆102Updated 4 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆147Updated 11 months ago
- Voice models for Mimic 3 text to speech system☆143Updated 9 months ago
- Web app for keyword spotting using TensorflowJS☆71Updated 2 years ago
- On-device noise suppression powered by deep learning☆69Updated 3 weeks ago
- Metadata and versioning details for the Common Voice dataset☆146Updated 3 weeks ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆111Updated 2 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆99Updated 2 months ago
- 🐸TTS recipes for different datasets☆86Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- ☆39Updated 3 weeks ago