st-matskevich / local-wakeLinks
Wake word detection with custom phrases without model training
☆16Updated 2 months ago
Alternatives and similar repositories for local-wake
Users that are interested in local-wake are comparing it to the libraries listed below
Sorting:
- A simple, but performant framework for mapping speech directly to categories and intents.☆22Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆122Updated last month
- On-device noise suppression powered by deep learning☆75Updated 2 months ago
- Speaker diarization service☆24Updated 4 months ago
- Create an LJSpeech structured voice dataset on wave input☆36Updated last year
- Open TTS models, built for streaming on the edge☆43Updated 7 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated this week
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆27Updated 3 years ago
- Tensorflow-based wake word detection☆16Updated this week
- ☆49Updated last week
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!☆25Updated 8 months ago
- Add n-gram and large language model (LLM) support to Whisper models.☆32Updated 5 months ago
- ☆19Updated 7 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆127Updated 2 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆69Updated 3 months ago
- Zero-shot Audio Classification using Whisper☆78Updated 2 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆131Updated 11 months ago
- Automated, end-to-end wakeword model maker using the Precise Wakeword Engine☆23Updated 3 years ago
- A curated list of awesome voice activity detection☆67Updated 11 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆217Updated 11 months ago
- ☆63Updated 2 weeks ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆34Updated 5 years ago
- Generate samples using Piper to train wake word models☆56Updated last month
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆43Updated last month
- A random walk voice style cloning application for Kokoro text to speech☆152Updated 4 months ago
- An open source voice assistant toolkit for many human languages☆379Updated last year