synesthesiam / voice2json-profilesLinks
Speech models and artifacts for voice2json
☆14Updated 2 years ago
Alternatives and similar repositories for voice2json-profiles
Users that are interested in voice2json-profiles are comparing it to the libraries listed below
Sorting:
- Docker images for Coqui AI☆61Updated 4 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- Wake word detection engine based on Snips Personal Wakeword Detector☆60Updated 2 years ago
- Coqui AI TTS plugin☆85Updated 5 months ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- 🐍 Coqui's machine learning job scheduler☆31Updated 4 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆46Updated 2 years ago
- streaming speech to text server using Whisper☆98Updated 2 years ago
- Botium Speech Processing☆943Updated 3 weeks ago
- Improved SVOX PicoTTS speech synthesizer☆108Updated 4 years ago
- Speaker diarization model☆32Updated 2 years ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆56Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- 🐸STT integration examples☆129Updated 3 years ago
- A library for real-time voice processing in web browsers☆236Updated last month
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- Train your own speech AI model from scratch☆28Updated last week
- Voice models for Mimic 3 text to speech system☆160Updated last year
- Mycroft's multilingual text parsing and formatting library☆78Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆238Updated last week
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- openvino version of openai/whisper☆178Updated 2 years ago
- On-device streaming text-to-speech engine powered by deep learning☆126Updated last week
- Create an LJSpeech structured voice dataset on wave input☆37Updated last year
- Provides Piper neural text-to-speech voices as a browser extension☆71Updated 2 weeks ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.☆130Updated 4 years ago
- Timething is a library for aligning text transcripts with their audio recordings.☆126Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.☆134Updated last year
- Buildings block for voice-enabled applications in the browser☆37Updated 8 months ago
- Performant and accurate speech recognition built on Pytorch☆254Updated 3 years ago