painebenjamin / hey-buddyLinks
An end-to-end library for training audio wake-word models and deploying them in the browser.
☆38Updated 6 months ago
Alternatives and similar repositories for hey-buddy
Users that are interested in hey-buddy are comparing it to the libraries listed below
Sorting:
- chatterbox TTS + Voice Clone using onnx☆27Updated 3 weeks ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Updated last year
- Repo & Project for the Imminent Research Grant code & tasks☆12Updated last year
- Whisper finetuning☆15Updated 9 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Updated 9 months ago
- Russian accentuator and IPA transcriber☆17Updated last year
- Neural text to speech system that uses eSpeak as a text/phoneme front-end☆16Updated 4 years ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 10 months ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Updated 2 months ago
- Pybind11 bindings for Kaldi☆15Updated 2 weeks ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20Updated 8 months ago
- ☆20Updated 3 years ago
- Evaluation of STT models for german language☆15Updated 4 years ago
- Forced alignment decoder for Whisper.☆14Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Updated 7 months ago
- Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G…☆20Updated 8 months ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Updated 7 months ago
- zero-shot realtime TTS system, fully offline, free and open source☆50Updated 9 months ago
- ☆11Updated 4 months ago
- ☆17Updated 4 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆21Updated 4 months ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Updated last year
- Indonesian speech/phoneme recognizer powered by Kaldi 2.0 (lhotse, icefall, sherpa).☆15Updated 2 years ago
- All-in-one Speech Transcription☆10Updated this week
- An upgrade framework for train and validate compare with icefall using Lightning.☆14Updated 10 months ago
- Free Dutch voice dataset☆12Updated 5 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Updated last year
- ☆16Updated this week
- A Weakly Supervised Forced Alignment for disluent speech☆15Updated 2 years ago