SEPIA-Framework / sepia-stt-server
SEPIA server to support open-source speech recognition via WebSocket connection.
☆123Updated 3 months ago
Alternatives and similar repositories for sepia-stt-server:
Users that are interested in sepia-stt-server are comparing it to the libraries listed below
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆200Updated 2 weeks ago
- 🐸STT integration examples☆125Updated 2 years ago
- Application to communicate with SEPIA via browser, iOS and Android. Works as chat messenger with personal-assistant, ASR and TTS integrat…☆63Updated last year
- Wake word detection engine based on Snips Personal Wakeword Detector☆53Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- An even smaller speech recognizer / force aligner☆32Updated 2 months ago
- Open models for Coqui STT☆129Updated last year
- Web app for keyword spotting using TensorflowJS☆70Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆305Updated 3 months ago
- C++ library for converting text to phonemes for Piper☆108Updated 11 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆145Updated 10 months ago
- Core server of the SEPIA Framework responsible for NLU, conversation, smart-service integration, user-accounts and more.☆94Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆28Updated last year
- Voice models for Mimic 3 text to speech system☆141Updated 8 months ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆97Updated 3 weeks ago
- Coqui Inference Engine☆38Updated 3 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Docker images for Coqui AI☆57Updated 3 years ago
- Desktop application for neural speech synthesis written in C++☆213Updated 2 years ago
- Performant and accurate speech recognition built on Pytorch☆252Updated 2 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆102Updated 4 years ago
- An automatic speech recognition API☆54Updated this week
- On-device speaker diarization powered by deep learning☆37Updated 2 weeks ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆110Updated last year
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech☆331Updated 3 years ago
- STT Service based on Kaldi ASR☆15Updated 6 years ago
- 🐸TTS recipes for different datasets☆85Updated 2 years ago