Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)
☆59Jun 3, 2024Updated last year
Alternatives and similar repositories for PhonMatchNet
Users that are interested in PhonMatchNet are comparing it to the libraries listed below
Sorting:
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆45Jan 24, 2026Updated last month
- Official code for Metric learning for user-defined keyword spotting☆38Feb 21, 2024Updated 2 years ago
- Test Framework for few-shot open set KWS☆41Nov 8, 2024Updated last year
- ☆91Jun 9, 2024Updated last year
- ☆54Jul 16, 2025Updated 7 months ago
- Streaming Vocos☆30Jun 10, 2025Updated 8 months ago
- Recipe for LibriPhrase☆33Sep 2, 2023Updated 2 years ago
- ☆11Nov 7, 2024Updated last year
- Collection of PyTorch implementations of Spoken Keyword Spotting presented in research papers.☆36Apr 5, 2024Updated last year
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Nov 14, 2023Updated 2 years ago
- DPDFNet: causal single-channel speech enhancement that boosts DeepFilterNet2 with dual-path RNN blocks for stronger long-range temporal a…☆37Updated this week
- End-To-End SpeechSynthesis system with knowledge distillation☆18Jul 16, 2022Updated 3 years ago
- Pytorch implementation of BiFSMNv2, TNNLS 2023☆35Feb 10, 2023Updated 3 years ago
- An unofficial pytorch implementation of "STREAMVC: REAL-TIME LOW-LATENCY VOICE CONVERSION".☆81Apr 15, 2025Updated 10 months ago
- Keyword spotting and forced alignment in any language☆91Feb 12, 2026Updated 2 weeks ago
- ☆89May 31, 2023Updated 2 years ago
- ☆21Jul 15, 2024Updated last year
- This repository contains the Code for SOTA model on Google Speech Command V2 dataset.☆16Sep 28, 2023Updated 2 years ago
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆155Aug 9, 2025Updated 6 months ago
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆115Jun 23, 2025Updated 8 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).☆282May 23, 2022Updated 3 years ago
- ☆19Mar 22, 2024Updated last year
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 2 years ago
- The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".☆289Jul 26, 2025Updated 7 months ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆21May 26, 2025Updated 9 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆38Oct 28, 2025Updated 4 months ago
- [Tiny KWS] SparkNet: Sparse Binarization for Fast Keyword Spotting☆17Aug 26, 2025Updated 6 months ago
- text to speech☆10Mar 19, 2024Updated last year
- Real-Time De-noising and De-reverbing with Tiny Recurrent UNet☆54Jun 7, 2023Updated 2 years ago
- Unofficial implementation of wavenext vocoder☆59Aug 28, 2024Updated last year
- ☆40Jul 15, 2025Updated 7 months ago
- Ablation study of local spectral attention (LSA) for full-band speech enhancement (SE)☆28Sep 16, 2023Updated 2 years ago
- Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement☆471May 19, 2025Updated 9 months ago
- ☆22Jul 30, 2025Updated 7 months ago
- ☆11Mar 22, 2023Updated 2 years ago
- Accompanying repository for the paper "Automatic Music Mixing Using a Generative Model of Effect Embeddings"☆22Jan 18, 2026Updated last month