st-matskevich / local-wakeLinks
Wake word detection with custom phrases without model training
☆21Updated 2 months ago
Alternatives and similar repositories for local-wake
Users that are interested in local-wake are comparing it to the libraries listed below
Sorting:
- A simple, but performant framework for mapping speech directly to categories and intents.☆22Updated last year
- Open TTS models, built for streaming on the edge☆44Updated 7 months ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!☆25Updated 8 months ago
- Create an LJSpeech structured voice dataset on wave input☆37Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated 3 weeks ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆27Updated 3 years ago
- Add n-gram and large language model (LLM) support to Whisper models.☆35Updated 6 months ago
- ☆19Updated 8 months ago
- ☆11Updated 2 months ago
- ☆50Updated this week
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆128Updated 3 months ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆27Updated 8 months ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 3 years ago
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆115Updated last month
- On-device noise suppression powered by deep learning☆76Updated 3 months ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆18Updated 7 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Audio tokenization, in the fastest way possible!☆53Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆102Updated last year
- Speaker diarization service☆24Updated 4 months ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Updated 11 months ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆44Updated 2 months ago
- On-device streaming text-to-speech engine powered by deep learning☆122Updated 2 months ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Updated 4 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- ☆55Updated 2 years ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆71Updated 4 months ago