On-device speech-to-text engine powered by deep learning
☆472Updated this week
Alternatives and similar repositories for leopard
Users that are interested in leopard are comparing it to the libraries listed below
Sorting:
- On-device streaming speech-to-text engine powered by deep learning☆658Feb 13, 2026Updated 2 weeks ago
- On-device voice activity detection (VAD) powered by deep learning☆244Feb 13, 2026Updated 2 weeks ago
- benchmark for Speech-to-Intent engines☆17Dec 18, 2025Updated 2 months ago
- On-device Speech-to-Intent engine powered by deep learning☆698Feb 13, 2026Updated 2 weeks ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- On-device voice assistant platform powered by deep learning☆684Apr 11, 2025Updated 10 months ago
- A library for real-time voice processing in web browsers☆238Feb 22, 2026Updated last week
- On-device streaming text-to-speech engine powered by deep learning☆131Updated this week
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- On-device wake word detection powered by deep learning☆4,700Feb 13, 2026Updated 2 weeks ago
- Picovoice Browser Extension☆15Feb 9, 2026Updated 2 weeks ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- ☆12Mar 18, 2022Updated 3 years ago
- speech to text benchmark framework☆680Jan 15, 2026Updated last month
- Gloss3D - 3D Modeler for Linux and Windows☆35May 13, 2025Updated 9 months ago
- Silero Models: pre-trained text-to-speech models made embarrassingly simple☆5,793Feb 3, 2026Updated 3 weeks ago
- 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.☆2,567Mar 11, 2024Updated last year
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Mar 30, 2020Updated 5 years ago
- Command-line tools for speech and intent recognition on Linux☆1,107Mar 7, 2024Updated last year
- utt is the universal text transformer☆451Oct 7, 2024Updated last year
- Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node☆14,301Feb 22, 2026Updated last week
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Ras…☆26,727Jun 19, 2025Updated 8 months ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆21Sep 21, 2025Updated 5 months ago
- ANIL(A Nice Intermediate Language) Python & C++ inspired programming language that transpiles to C and can be embedded within C source fi…☆63Updated this week
- On-device speaker recognition engine powered by deep learning☆41Feb 13, 2026Updated 2 weeks ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.☆130Mar 31, 2021Updated 4 years ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Aug 29, 2023Updated 2 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆21Jul 26, 2021Updated 4 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- Scripts for training Kaldi for German speech recognition (ASR).☆27Feb 11, 2021Updated 5 years ago
- MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algor…☆19Mar 15, 2020Updated 5 years ago
- Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.☆19Apr 22, 2019Updated 6 years ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- Vim Speech Recognition Experiments☆20May 30, 2025Updated 9 months ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆45Feb 9, 2026Updated 3 weeks ago
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR mode…☆914Jan 2, 2026Updated 2 months ago