st-matskevich / local-wakeLinks
Wake word detection with custom phrases without model training
☆22Updated 3 months ago
Alternatives and similar repositories for local-wake
Users that are interested in local-wake are comparing it to the libraries listed below
Sorting:
- Evaluation of STT models for german language☆15Updated 3 years ago
- A simple, but performant framework for mapping speech directly to categories and intents.☆22Updated last year
- Open TTS models, built for streaming on the edge☆44Updated 8 months ago
- ☆51Updated last week
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆18Updated 7 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆69Updated last month
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆128Updated 3 months ago
- Add n-gram and large language model (LLM) support to Whisper models.☆36Updated 6 months ago
- Speaker diarization service☆24Updated 5 months ago
- ☆19Updated 8 months ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Updated 6 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆27Updated 3 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Updated 8 months ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Updated last year
- High-performance, semantic turn detection for conversational AI☆28Updated 2 months ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆25Updated last year
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!☆25Updated 9 months ago
- ☆11Updated 3 months ago
- Create an LJSpeech structured voice dataset on wave input☆36Updated last year
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Updated 2 weeks ago
- On-device noise suppression powered by deep learning☆77Updated last week
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30Updated 2 years ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Updated this week
- StyleTTS 2 Optimized Training Fork☆34Updated 10 months ago
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆17Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆27Updated 8 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆45Updated 2 months ago
- Coqui AI TTS plugin☆87Updated 5 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- An automatic speech recognition API☆76Updated 2 weeks ago