Train your own speech AI model from scratch
☆148Feb 17, 2026Updated 2 weeks ago
Alternatives and similar repositories for tiny-audio
Users that are interested in tiny-audio are comparing it to the libraries listed below
Sorting:
- Mattermost is an open source platform for secure collaboration across the entire software development lifecycle..☆27Oct 20, 2025Updated 4 months ago
- AndroidSubSystem4GNU/Linux☆32Dec 30, 2025Updated 2 months ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- Token-Level Supervised Contrastive Learning for Punctuation Restoration☆29Sep 8, 2021Updated 4 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆37Oct 6, 2023Updated 2 years ago
- CTC decoder with hotwords for ASR.☆34Apr 13, 2025Updated 10 months ago
- eSNN - Learning similarity measure from data☆12Nov 28, 2019Updated 6 years ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- Multi Face Recognition and Detection☆67Nov 1, 2022Updated 3 years ago
- Colab notebooks for Next-gen Kaldi☆30Oct 12, 2025Updated 4 months ago
- Extract phoneme-level timestamps from speeh audio.☆119Updated this week
- An implementation of Jasper, QuartzNet, Citrinet and pipeline for training CTC-based ASR models☆12Nov 13, 2021Updated 4 years ago
- Three.js -> TSL -> Raymarching Clouds -> Tornado☆38Nov 29, 2025Updated 3 months ago
- ☆13Oct 27, 2021Updated 4 years ago
- A simple HTTP server that wraps an unofficial free WhatsApp API.☆16Aug 19, 2025Updated 6 months ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 8 months ago
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆23Aug 14, 2025Updated 6 months ago
- cut pdf into pieces☆29Dec 20, 2025Updated 2 months ago
- Yet another machine learning-based WAF research☆26Jun 21, 2022Updated 3 years ago
- ☆35Feb 10, 2026Updated 3 weeks ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- Lightweight && straight forward command-line tool for searching and downloading exploits from Exploit-DB.☆47Jan 22, 2026Updated last month
- Simple voice activity detection (VAD) algorithm in Python☆15Aug 10, 2023Updated 2 years ago
- ☆15Nov 5, 2021Updated 4 years ago
- ☆40Oct 2, 2025Updated 5 months ago
- Claude Code Skills for .NET developers.☆43Jan 26, 2026Updated last month
- Nextcloud MCP Server: Connect AI assistants to your Nextcloud instance with 34 comprehensive tools for Notes, Calendar, Contacts, Tables,…☆27Jan 15, 2026Updated last month
- noise reduction☆17Jul 3, 2024Updated last year
- ☆32Dec 4, 2022Updated 3 years ago
- Crowdsourced and Automatic Speech Prominence Estimation☆25Apr 12, 2024Updated last year
- Simple Web CLI Web UI Interface☆32Feb 1, 2026Updated last month
- Simple Python web server for HTTP request and browser fingerprinting with whitelist and callback functionality.☆25Apr 18, 2023Updated 2 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- ☆37Nov 22, 2025Updated 3 months ago
- Code for the blog "Neural audio codecs: how to get audio into LLMs"☆156Oct 20, 2025Updated 4 months ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Sep 19, 2022Updated 3 years ago
- ☆21Jul 15, 2024Updated last year