Picovoice/speech-to-text-benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Picovoice/speech-to-text-benchmark)

Picovoice / speech-to-text-benchmark

speech to text benchmark framework

☆696

Alternatives and similar repositories for speech-to-text-benchmark

Users that are interested in speech-to-text-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Picovoice / cheetah
View on GitHub
On-device streaming speech-to-text engine powered by deep learning
☆669Jul 22, 2026Updated last week
Picovoice / wake-word-benchmark
View on GitHub
wake word engine benchmark framework
☆160Jul 18, 2026Updated last week
airbnb / artificial-adversary
View on GitHub
🗣️ Tool to generate adversarial text examples and test machine learning models against them
☆405Jan 7, 2022Updated 4 years ago
NTRLab / MediaSpeech
View on GitHub
☆22Jul 22, 2022Updated 4 years ago
Franck-Dernoncourt / ASR_benchmark
View on GitHub
Program to benchmark various speech recognition APIs
☆82Sep 6, 2019Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
flashlight / wav2letter
View on GitHub
Facebook AI Research's Automatic Speech Recognition Toolkit
☆6,439Jul 14, 2026Updated 2 weeks ago
syhw / wer_are_we
View on GitHub
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
☆1,864Jun 27, 2022Updated 4 years ago
mozilla / DeepSpeech
View on GitHub
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Ras…
☆26,771Jun 19, 2025Updated last year
Picovoice / speech-to-intent-benchmark
View on GitHub
benchmark for Speech-to-Intent engines
☆18Updated this week
robmsmt / ASR-Audio-Data-Links
View on GitHub
A list of publically available audio data that anyone can download for ASR or other speech activities
☆237Aug 6, 2021Updated 4 years ago
gooofy / zamia-speech
View on GitHub
Open tools and data for cloudless automatic speech recognition
☆449Mar 30, 2021Updated 5 years ago
viralpoetry / alzheimer-password-generator
View on GitHub
Chrome extension for domain dependent password generation.
☆13Oct 14, 2025Updated 9 months ago
Picovoice / porcupine
View on GitHub
On-device wake word detection powered by deep learning
☆4,892Jul 21, 2026Updated last week
agnusmaximus / Word2Bits
View on GitHub
Quantized word vectors that take 8x-16x less space than regular word vectors
☆753Mar 31, 2020Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Picovoice / rhino
View on GitHub
On-device Speech-to-Intent engine powered by deep learning
☆705Updated this week
Picovoice / leopard
View on GitHub
On-device speech-to-text engine powered by deep learning
☆482Updated this week
jsn5 / dancenet
View on GitHub
DanceNet -💃💃Dance generator using Autoencoder, LSTM and Mixture Density Network. (Keras)
☆519Sep 15, 2019Updated 6 years ago
coryshain / dnnseg
View on GitHub
☆11Mar 20, 2021Updated 5 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
howonlee / twostrangethings
View on GitHub
two strange things to do with neural nets
☆15Feb 18, 2019Updated 7 years ago
Picovoice / octopus
View on GitHub
On-device Speech-to-Index engine powered by deep learning
☆36Apr 16, 2025Updated last year
narVidhai / Speech-Transcription-Benchmarking
View on GitHub
Example python scripts to evaluate various ASR methods
☆11Dec 22, 2021Updated 4 years ago
revdotcom / words2num
View on GitHub
Convert words to numbers
☆21Apr 13, 2022Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
patyork / AutomaticSpeechChunker
View on GitHub
From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…
☆17May 15, 2015Updated 11 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
alumae / kaldi-gstreamer-server
View on GitHub
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
☆1,094Jun 8, 2024Updated 2 years ago
astorfi / Deep-Learning-Roadmap
View on GitHub
Organized Resources for Deep Learning Researchers and Developers
☆3,185Dec 22, 2022Updated 3 years ago
zzw922cn / Automatic_Speech_Recognition
View on GitHub
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
☆2,834Mar 24, 2023Updated 3 years ago
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
espnet / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆9,903Updated this week
CoEDL / kaldi_helpers
View on GitHub
A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
☆15May 19, 2020Updated 6 years ago
bajibabu / GlottGAN
View on GitHub
This repository contains the files used for our Interspeech 2017 paper.
☆16May 30, 2017Updated 9 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
gooofy / py-kaldi-asr
View on GitHub
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
☆169Feb 23, 2021Updated 5 years ago
batikim09 / LIVE_SER
View on GitHub
Live demo for speech emotion recognition using Keras and Tensorflow models
☆39Aug 2, 2024Updated last year
freewym / espresso
View on GitHub
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
☆939Sep 4, 2024Updated last year
at16k / at16k
View on GitHub
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
☆130Mar 31, 2021Updated 5 years ago
AI4Bharat / NPTEL2020-Indian-English-Speech-Dataset
View on GitHub
NPTEL2020: Speech2Text dataset for Indian-English Accent
☆86Apr 2, 2026Updated 3 months ago
iceychris / LibreASR
View on GitHub
An On-Premises, Streaming Speech Recognition System
☆679Nov 28, 2021Updated 4 years ago
charlesliucn / LanMIT
View on GitHub
📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.
☆22Jul 12, 2019Updated 7 years ago