An open-source, easily accessible package for training and deploying Speech-to-Intent models on microcontrollers and SBCs
☆50Mar 14, 2024Updated 2 years ago
Alternatives and similar repositories for Speech-to-Intent-Micro
Users that are interested in Speech-to-Intent-Micro are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- [Tiny KWS] SparkNet: Sparse Binarization for Fast Keyword Spotting☆17Aug 26, 2025Updated 7 months ago
- One command to build TLG.fst for WeNet.☆30Oct 11, 2022Updated 3 years ago
- Making Espnet easier to use☆54Apr 9, 2021Updated 5 years ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Feb 25, 2019Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- brainless concatenative text to speech☆14May 11, 2021Updated 4 years ago
- KWS demo based on CTC prefix beam search.☆17Oct 21, 2023Updated 2 years ago
- Translating Synthetic RIRs to Real RIRs☆45Sep 15, 2023Updated 2 years ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆14Feb 5, 2025Updated last year
- Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?☆19Oct 5, 2022Updated 3 years ago
- A summary of 9 mainstream algorithms practice, including : Logistic Regression / Decision Tree / Random Forest / Adaboost / SVM / Cluste…☆13Dec 19, 2018Updated 7 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- NSNet2 Deep Noise Suppression (DNS) package☆39Sep 12, 2022Updated 3 years ago
- Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation☆18May 17, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆10May 5, 2025Updated 11 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 5 months ago
- ☆18May 27, 2025Updated 10 months ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- ☆14Jul 24, 2025Updated 8 months ago
- The power-law compressed phase-aware asymmetric (PLCPA-ASYM) loss☆14Sep 4, 2023Updated 2 years ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Sep 27, 2024Updated last year
- ☆11Feb 7, 2015Updated 11 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- Big Impulse Response Dataset☆156Oct 19, 2022Updated 3 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 3 years ago
- ☆12Nov 17, 2023Updated 2 years ago
- SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech☆11Jun 30, 2023Updated 2 years ago
- ☆14Aug 19, 2024Updated last year
- ☆14Aug 16, 2023Updated 2 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆27Mar 13, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆60Jun 3, 2024Updated last year
- combine ASR, LLM and TTS in local development with python☆17Sep 21, 2024Updated last year
- [USENIX Security 2025] SafeSpeech: Robust and Universal Voice Protection Against Malicious Speech Synthesis☆32May 24, 2025Updated 10 months ago
- NNSE (Neural Network Speech Enhancement) is a speech-denoiser optimized to run on Ambiq's low power platform☆44Nov 13, 2025Updated 5 months ago
- Test Framework for few-shot open set KWS☆42Nov 8, 2024Updated last year
- This is the code for the WASPAA 2021 paper "Blind Room Parameter Estimation Using Multiple Multichannel Speech Recordings☆17Nov 9, 2022Updated 3 years ago