An open-source, easily accessible package for training and deploying Speech-to-Intent models on microcontrollers and SBCs
☆49Mar 14, 2024Updated last year
Alternatives and similar repositories for Speech-to-Intent-Micro
Users that are interested in Speech-to-Intent-Micro are comparing it to the libraries listed below
Sorting:
- [Tiny KWS] SparkNet: Sparse Binarization for Fast Keyword Spotting☆17Aug 26, 2025Updated 6 months ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 2 years ago
- ☆14Aug 16, 2023Updated 2 years ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆13Feb 5, 2025Updated last year
- brainless concatenative text to speech☆14May 11, 2021Updated 4 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech☆11Jun 30, 2023Updated 2 years ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Sep 27, 2024Updated last year
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 9 months ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- ☆14Aug 19, 2024Updated last year
- Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation☆18May 17, 2023Updated 2 years ago
- Indic-Conformer models for ASR☆21Jul 19, 2024Updated last year
- Making Espnet easier to use☆54Apr 9, 2021Updated 4 years ago
- ☆18May 27, 2025Updated 9 months ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- NSNet2 Deep Noise Suppression (DNS) package☆39Sep 12, 2022Updated 3 years ago
- ☆12Nov 17, 2023Updated 2 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆18Jun 12, 2022Updated 3 years ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Feb 2, 2026Updated last month
- KWS demo based on CTC prefix beam search.☆17Oct 21, 2023Updated 2 years ago
- ☆14Jul 24, 2025Updated 7 months ago
- ESP32 Android Studio, for detail click README file☆20May 5, 2022Updated 3 years ago
- Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?☆19Oct 5, 2022Updated 3 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 2 years ago
- Python implementation of a few speech intelligibility prediction algorithms☆15May 29, 2024Updated last year
- ☆21Mar 4, 2024Updated 2 years ago
- Translating Synthetic RIRs to Real RIRs☆45Sep 15, 2023Updated 2 years ago
- This is a demo project showing how to fine-tune and deploy the Whisper model on SageMaker.☆25Dec 20, 2023Updated 2 years ago
- RVC Onnx Infer- Upgraded and simplified-ish☆25May 9, 2024Updated last year
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…☆24Feb 17, 2023Updated 3 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆27Aug 1, 2023Updated 2 years ago
- NXP Platform Accelerator for i.MXRT595 EVK☆11Jun 30, 2025Updated 8 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 4 months ago
- This repository contains text-to-speech (TTS) models and utilities designed produce synthetic training datasets for other speech-related …☆28Mar 12, 2023Updated 2 years ago
- This is a winter of code project aimed at speech enhancement of text to speech models.☆24Feb 6, 2022Updated 4 years ago