This repository contains the Code for SOTA model on Google Speech Command V2 dataset.
☆16Sep 28, 2023Updated 2 years ago
Alternatives and similar repositories for GoogleSpeechCommandLowFootprint
Users that are interested in GoogleSpeechCommandLowFootprint are comparing it to the libraries listed below
Sorting:
- ☆89May 31, 2023Updated 2 years ago
- PrimeK-Net official code☆26Mar 5, 2025Updated 11 months ago
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆59Jun 3, 2024Updated last year
- InSales e-commerce platform API bindings☆14Jul 13, 2024Updated last year
- Test Framework for few-shot open set KWS☆41Nov 8, 2024Updated last year
- A time delay estimation method for event-based time-series data. Time delay estimation is also known as the correction of time offsets an…☆15Dec 3, 2025Updated 2 months ago
- Code for reproducing work of ICML 2019 paper: Memory-Optimal Direct Convolutions for Maximizing Classification Accuracy in Embedded Appli…☆12Jun 8, 2019Updated 6 years ago
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆32Jul 9, 2024Updated last year
- This repo contains required files for the INTERSPEECH 2022 Audio Deep Packet Loss Concealment (PLC) Challenge.☆90Feb 13, 2026Updated 2 weeks ago
- ☆29Dec 20, 2025Updated 2 months ago
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.☆22Sep 20, 2025Updated 5 months ago
- ☆10Oct 20, 2022Updated 3 years ago
- Frame-agnostic XAI Library for Computer Vision, for understanding why models behave that way.☆11Feb 19, 2023Updated 3 years ago
- Russian phonetical transcription☆11Nov 19, 2025Updated 3 months ago
- ☆12Apr 26, 2021Updated 4 years ago
- Noise Reduction Preprocessing-based Fully Automatic Diagonal Loading Method for Robust Adaptive Beamforming☆13Feb 24, 2020Updated 6 years ago
- superfast text to speech in any voice☆60Feb 16, 2026Updated 2 weeks ago
- A local, voice-controlled AI assistant with the personality of HAL 9000 from 2001: A Space Odyssey.☆21Aug 16, 2025Updated 6 months ago
- Nanos klib for NVIDIA GPUs☆14Mar 25, 2025Updated 11 months ago
- BC-ResNet for Keyword Spotting☆41Jan 11, 2022Updated 4 years ago
- ☆11Apr 12, 2024Updated last year
- [Tiny KWS] SparkNet: Sparse Binarization for Fast Keyword Spotting☆17Aug 26, 2025Updated 6 months ago
- Tokenizer for Text to Speech (TTS) models☆13Jan 16, 2025Updated last year
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- Finetuning LLaMA with DeepSpeed☆10Apr 14, 2023Updated 2 years ago
- 🗣️ Convert between phonetic alphabets☆11Feb 7, 2022Updated 4 years ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- Official implementation of the paper "M3CoTBench: Benchmark Chain-of-Thought of MLLMs in Medical Image Understanding"☆21Jan 14, 2026Updated last month
- A hackable library for running and fine-tuning modern transformer models on commodity and alternative GPUs, powered by tinygrad.☆28Feb 10, 2026Updated 2 weeks ago
- Package for word stress detection☆11Jan 27, 2023Updated 3 years ago
- Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021☆11Jun 13, 2021Updated 4 years ago
- ☆14Oct 19, 2025Updated 4 months ago
- Fork of RecurrentGPT with modifications☆10Sep 18, 2024Updated last year
- FastAudio is a Learnable Audio Frontend team Magnum's designed for the ASVspoof 2021 challenge☆46May 6, 2023Updated 2 years ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆54Mar 5, 2025Updated 11 months ago
- Spoken Language assessment☆46Nov 17, 2020Updated 5 years ago
- Towards a Mechanistic Understanding of Large Reasoning Models: A Survey of Training, Inference, and Failures☆30Jan 29, 2026Updated last month
- Neural model for prediction of stress position in Russian words☆13Jun 22, 2025Updated 8 months ago
- golang vad (voice activity detection) library based on webrtc☆12Dec 13, 2021Updated 4 years ago