On-device speech-to-text engine powered by deep learning
☆477Apr 9, 2026Updated this week
Alternatives and similar repositories for leopard
Users that are interested in leopard are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- On-device streaming speech-to-text engine powered by deep learning☆661Updated this week
- benchmark for Speech-to-Intent engines☆17Mar 27, 2026Updated 2 weeks ago
- On-device noise suppression powered by deep learning☆86Updated this week
- On-device Speech-to-Intent engine powered by deep learning☆698Updated this week
- On-device Speech-to-Index engine powered by deep learning☆36Apr 16, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- On-device voice activity detection (VAD) powered by deep learning☆248Mar 26, 2026Updated 2 weeks ago
- On-device speaker diarization powered by deep learning☆69Updated this week
- On-device voice assistant platform powered by deep learning☆688Apr 11, 2025Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆136Apr 3, 2026Updated last week
- A library for real-time voice processing in web browsers☆241Mar 27, 2026Updated 2 weeks ago
- On-device speaker recognition engine powered by deep learning☆42Updated this week
- On-device wake word detection powered by deep learning☆4,782Updated this week
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- On-device LLM Inference Powered by X-Bit Quantization☆309Mar 27, 2026Updated 2 weeks ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 4 years ago
- 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.☆2,580Mar 11, 2024Updated 2 years ago
- Speaker diarization benchmark framework☆40Jan 8, 2026Updated 3 months ago
- Convert words to numbers☆21Apr 13, 2022Updated 4 years ago
- Detect and remove or lower the volume of breathing in speech recordings.☆14May 14, 2025Updated 11 months ago
- Korean ASR Corpus generated from TEDx talks☆27Jan 11, 2019Updated 7 years ago
- Silero Models: pre-trained text-to-speech models made embarrassingly simple☆5,861Mar 27, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A LLaMA2-7b chatbot with memory running on CPU, and optimized using smooth quantization, 4-bit quantization or Intel® Extension For PyTor…☆15Feb 27, 2024Updated 2 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 4 years ago
- Command-line tools for speech and intent recognition on Linux☆1,107Mar 7, 2024Updated 2 years ago
- ☆12Mar 18, 2022Updated 4 years ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆22May 26, 2025Updated 10 months ago
- a quick-and-dirty glsl prototyping tool☆26Aug 2, 2020Updated 5 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algor…☆19Mar 15, 2020Updated 6 years ago
- Gloss3D - 3D Modeler for Linux and Windows☆35May 13, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆24Oct 8, 2025Updated 6 months ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node☆14,529Feb 22, 2026Updated last month
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.☆130Mar 31, 2021Updated 5 years ago
- Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.☆19Apr 22, 2019Updated 6 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Mar 30, 2020Updated 6 years ago