rhasspy / piper-sample-generator
Generate samples using Piper to train wake word models
☆34Updated last year
Alternatives and similar repositories for piper-sample-generator:
Users that are interested in piper-sample-generator are comparing it to the libraries listed below
- Tensorflow-based wake word detection☆12Updated 6 months ago
- A simple, but performant framework for mapping speech directly to categories and intents.☆19Updated 8 months ago
- Detect wake words for ESPHome's voice assistant component on the device☆27Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated last year
- Snowboy reimplementation☆85Updated 2 years ago
- Create an LJSpeech structured voice dataset on wave input☆28Updated 6 months ago
- A TensorFlow based wake word detection training framework using synthetic sample generation suitable for certain microcontrollers.☆519Updated 2 months ago
- ☆55Updated last month
- Keyword Spotting (KWS) API wrapper for TFLite streaming models.☆12Updated 3 years ago
- Tiny wrapper around webrtc-audio-processing for noise suppression/auto gain only☆21Updated 9 months ago
- Mike/Projects/pysilero-vad.git☆18Updated 2 weeks ago
- Local voice recording for creating Piper datasets☆148Updated last month
- Coqui Inference Engine☆38Updated 3 years ago
- On-device streaming text-to-speech engine powered by deep learning☆76Updated this week
- Evaluation of STT models for german language☆15Updated 3 years ago
- On-device speaker diarization powered by deep learning☆43Updated last month
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆67Updated 2 years ago
- C++ library for converting text to phonemes for Piper☆115Updated last year
- StyleTTS 2 Optimized Training Fork☆27Updated 2 months ago
- ONNX Inference of Pyannote Segmentation☆85Updated 4 months ago
- A curated list of awesome voice activity detection☆48Updated 5 months ago
- An open source voice assistant toolkit for many human languages☆352Updated last year
- ☆26Updated 2 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆20Updated 2 weeks ago
- Voice activity engine benchmark framework☆13Updated 2 years ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆16Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- ☆59Updated last week
- Target speaker automatic speech recognition (TS-ASR)☆11Updated last year