painebenjamin/hey-buddy

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/painebenjamin/hey-buddy)

painebenjamin / hey-buddy

An end-to-end library for training audio wake-word models and deploying them in the browser.

☆44

Alternatives and similar repositories for hey-buddy

Users that are interested in hey-buddy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

painebenjamin / taproot
View on GitHub
An open source real-time AI inference engine for seamless scaling
☆23Jul 2, 2025Updated last year
Bartelds / ctc-dro
View on GitHub
Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.
☆17May 16, 2025Updated last year
thu-spmi / CTC-TTS
View on GitHub
Code for CTC-TTS: LLM-based dual-streaming text-to-speech with CTC alignment, Interspeech 2026.
☆20Jun 9, 2026Updated last month
lugan113 / SynTTS-Commands-Official
View on GitHub
SynTTS-Commands is a large-scale, multilingual (English & Chinese) synthetic speech command dataset designed for low-power Keyword Spotti…
☆17Feb 5, 2026Updated 5 months ago
NickZaitsev / ru-normalizr
View on GitHub
ru-normalizr — лучший нормализатор русского текста без LLM. Приводит числа, даты, время, сокращения, римские цифры, символы и латиницу в …
☆17Updated this week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
mush42 / mantoq
View on GitHub
Arabic Grapheme-to-Phoneme (G2P) Conversion
☆16Mar 15, 2025Updated last year
changelinglab / PhoneticXeus
View on GitHub
A universal phone recognizer that can transcribe speech in 70+ languages into IPA
☆21Jun 9, 2026Updated last month
nikhilprasanth / Auris
View on GitHub
Offline audiobook reader for EPUB, PDF, and TXT with local OmniVoice TTS, character voices, and synced highlighting.
☆17May 12, 2026Updated 2 months ago
Topping1 / Supertonic-Voice-Mixer
View on GitHub
Voice mixer and modifier for SuperTonic TTS
☆32Nov 25, 2025Updated 7 months ago
avryhof / speech_recognition
View on GitHub
Speech recognition module for Python, supporting several engines and APIs, online and offline.
☆13Mar 9, 2022Updated 4 years ago
mikex86 / DeepSpeech-Java-Bindings
View on GitHub
Java Bindings for the C++ library DeepSpeech
☆10Jun 4, 2020Updated 6 years ago
42io / tflite_kws
View on GitHub
☆13May 1, 2026Updated 2 months ago
yuhangear / wenet-android
View on GitHub
☆13Oct 27, 2021Updated 4 years ago
idiap / bert-text-diarization-atc
View on GitHub
This is a repository for a paper accepted at the 2022 IEEE Spoken Language Technology Workshop (SLT 2022)
☆17Dec 1, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
zhaohb / MeloTTS-OV
View on GitHub
Using OpenVINO to speed up MeloTTS inference
☆15Nov 1, 2024Updated last year
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
Scicom-AI-Enterprise-Organization / Multilingual-TTS
View on GitHub
Building actual open source including dataset Multilingual TTS more than 150 languages with Voice Cloning.
☆54Jul 14, 2026Updated last week
talker93 / oneMinTTS
View on GitHub
Launch your speech synthesis within one minute.
☆12May 6, 2024Updated 2 years ago
5Hyeons / StyleTTS2-Vocos
View on GitHub
StyleTTS2 + Vocos as a Decoder
☆13Mar 24, 2025Updated last year
alumae / torch-xvectors-wav
View on GitHub
☆22Jun 30, 2021Updated 5 years ago
DicioTeam / dicio-skill
View on GitHub
Assistance component base for Dicio assistant components
☆13Apr 23, 2026Updated 2 months ago
Open-Speech-EkStep / crowdsource-dataplatform
View on GitHub
This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…
☆17Mar 6, 2023Updated 3 years ago
Open-Speech-EkStep / data-acquisition-pipeline
View on GitHub
☆18Apr 28, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
JarbasAl / kaldi_spotter
View on GitHub
wake word spotting with kaldi
☆19Dec 3, 2020Updated 5 years ago
wangzhaode / mnn-tts
View on GitHub
mnn tts demo.
☆19May 7, 2025Updated last year
XXH333 / WordVoice-5A-Pipeline
View on GitHub
The dataset construction pipeline for WordVoice-5A
☆15Updated this week
Alradyin / wallie-V2
View on GitHub
Open-source AI that watches and hears your screen and reacts live as any personality you design — for faceless content, autonomous AI str…
☆21Jul 7, 2026Updated 2 weeks ago
Prem-kumar27 / Fast-KTSpeechCrawler
View on GitHub
Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler
☆23Mar 21, 2021Updated 5 years ago
shiiiijp / ADFD
View on GitHub
Official Implementation for "Age-Dependent Face Diversification via Latent Space Analysis" (CGI2023)
☆15Jan 7, 2025Updated last year
JacobLinCool / vocal-separation
View on GitHub
This is a demo for SOTA vocal separation models. Upload an audio file and the model will separate the vocals from the background music. …
☆18Jul 25, 2024Updated last year
rendchevi / daisy-tts
View on GitHub
🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition
☆14Nov 15, 2025Updated 8 months ago
hecko-yes / tts-dataset-prompts
View on GitHub
Finally, some decent sample sentences
☆24Dec 3, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
st-matskevich / local-wake
View on GitHub
Wake word detection with custom phrases without model training
☆56Mar 8, 2026Updated 4 months ago
mzboito / IWSLT2022_Tamasheq_data
View on GitHub
Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…
☆18Nov 30, 2022Updated 3 years ago
AFun9 / Omnivoice-onnx
View on GitHub
☆15May 13, 2026Updated 2 months ago
xinshengwang / robpitch
View on GitHub
A pitch detection model trained to be robust against noise and reverberation environments.
☆27Jan 21, 2025Updated last year
EMRAI / emrai-synthetic-diarization-corpus
View on GitHub
☆22Sep 24, 2018Updated 7 years ago
mosave / LVTerminal
View on GitHub
Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)
☆19Feb 29, 2024Updated 2 years ago
dr87 / spin-for-rvc
View on GitHub
☆16Sep 6, 2025Updated 10 months ago