SEPIA-Framework / sepia-stt-serverLinks
SEPIA server to support open-source speech recognition via WebSocket connection.
β131Updated 11 months ago
Alternatives and similar repositories for sepia-stt-server
Users that are interested in sepia-stt-server are comparing it to the libraries listed below
Sorting:
- πΈSTT integration examplesβ129Updated 3 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β326Updated 11 months ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, reβ¦β47Updated 2 years ago
- Open models for Coqui STTβ146Updated 2 years ago
- An even smaller speech recognizer / force alignerβ36Updated 10 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.β119Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learningβ232Updated last month
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Updated 2 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ103Updated 5 years ago
- Coqui Inference Engineβ41Updated 4 years ago
- Zero-shot Audio Classification using Whisperβ78Updated 2 years ago
- Desktop application for neural speech synthesis written in C++β213Updated 2 years ago
- Web app for keyword spotting using TensorflowJSβ74Updated 2 years ago
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ259Updated 2 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ27Updated 3 years ago
- Voice models for Mimic 3 text to speech systemβ154Updated last year
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.β213Updated last year
- A very basic demonstration connecting speech recognition and text-to-speechβ20Updated 5 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated 2 years ago
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- An automatic speech recognition APIβ71Updated last month
- Coqui AI TTS pluginβ87Updated 3 months ago
- openvino version of openai/whisperβ176Updated last year
- Gecko - A Tool for Effective Annotation of Human Conversationsβ298Updated 2 years ago
- A simple, but performant framework for mapping speech directly to categories and intents.β22Updated last year
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ151Updated last year
- Docker images for Coqui AIβ60Updated 4 years ago
- A curated list of awesome voice activity detectionβ67Updated 11 months ago
- How to create your own model for voskβ75Updated 4 years ago