This repository contains the Code for SOTA model on Google Speech Command V2 dataset.
☆16Sep 28, 2023Updated 2 years ago
Alternatives and similar repositories for GoogleSpeechCommandLowFootprint
Users that are interested in GoogleSpeechCommandLowFootprint are comparing it to the libraries listed below
Sorting:
- ☆89May 31, 2023Updated 2 years ago
- PrimeK-Net official code☆27Mar 5, 2025Updated last year
- 🗣️ Convert between phonetic alphabets☆11Feb 7, 2022Updated 4 years ago
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆59Jun 3, 2024Updated last year
- A time delay estimation method for event-based time-series data. Time delay estimation is also known as the correction of time offsets an…☆15Dec 3, 2025Updated 3 months ago
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- golang vad (voice activity detection) library based on webrtc☆12Dec 13, 2021Updated 4 years ago
- ☆11Oct 20, 2022Updated 3 years ago
- Noise Reduction Preprocessing-based Fully Automatic Diagonal Loading Method for Robust Adaptive Beamforming☆13Feb 24, 2020Updated 6 years ago
- Any-Precision Deep Neural Networks (AAAI 2021)☆62May 2, 2020Updated 5 years ago
- [Tiny KWS] SparkNet: Sparse Binarization for Fast Keyword Spotting☆17Aug 26, 2025Updated 6 months ago
- Daily Neural Network Practice Season 3! ( Finishing up Masters)☆10Sep 9, 2019Updated 6 years ago
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆32Jul 9, 2024Updated last year
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Feb 17, 2023Updated 3 years ago
- ☆14Oct 19, 2025Updated 5 months ago
- ☆14Jun 19, 2019Updated 6 years ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆54Mar 5, 2025Updated last year
- ☆11Apr 12, 2024Updated last year
- This is a platform containing the datasets and federated learning algorithms in IoT environments.☆72Dec 9, 2024Updated last year
- The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"☆123Jun 29, 2022Updated 3 years ago
- Spoken Language assessment☆46Nov 17, 2020Updated 5 years ago
- combine ASR, LLM and TTS in local development with python☆17Sep 21, 2024Updated last year
- Mike/Projects/pysilero-vad.git☆24Feb 12, 2026Updated last month
- Test Framework for few-shot open set KWS☆42Nov 8, 2024Updated last year
- ☆16Dec 27, 2023Updated 2 years ago
- Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion☆20Jul 9, 2019Updated 6 years ago
- Implementation of Generalized Cross Correlation with Phase Transform (GCC-PHAT) library in C/C++.☆20Jul 8, 2019Updated 6 years ago
- [TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…☆49Sep 27, 2024Updated last year
- Official implementation of the paper "M3CoTBench: Benchmark Chain-of-Thought of MLLMs in Medical Image Understanding"☆21Jan 14, 2026Updated 2 months ago
- KWS demo based on CTC prefix beam search.☆17Oct 21, 2023Updated 2 years ago
- Rank 7th/1817 in the 2018 iFLYTEK AI Developer Challenge with acc 0.82 for the ten Chinese dialects classification task, this code was p…☆13Nov 19, 2023Updated 2 years ago
- [WIP]Trying to implement "Ultra Low Complexity Deep Learning Based Noise Suppression." arXiv preprint arXiv:2312.08132 (2023).☆26May 29, 2024Updated last year
- FastAudio is a Learnable Audio Frontend team Magnum's designed for the ASVspoof 2021 challenge☆46May 6, 2023Updated 2 years ago
- The Official Repo for Paper: Aligning Clinical Needs and AI Capabilities: A Survey on LLMs for Medical Reasoning☆22Sep 27, 2025Updated 5 months ago
- Application of MB-iSTFT-VITS components to vits2_pytorch☆134Dec 29, 2025Updated 2 months ago
- English to IPA with syllable correspondence☆13Aug 23, 2022Updated 3 years ago
- Code for the ACL 2020 Paper on Schwa Deletion in Hindi and Punjabi☆16Oct 30, 2023Updated 2 years ago
- Learning Efficient Representations for Keyword Spotting with Triplet Loss☆113Sep 14, 2022Updated 3 years ago
- Code for reproducing work of ICML 2019 paper: Memory-Optimal Direct Convolutions for Maximizing Classification Accuracy in Embedded Appli…☆12Jun 8, 2019Updated 6 years ago