ebu/benchmarkstt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ebu/benchmarkstt)

ebu / benchmarkstt

Open Source AI Benchmarking toolkit for benchmarking speech to text services

☆59

Alternatives and similar repositories for benchmarkstt

Users that are interested in benchmarkstt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bbc / stt-align-node
View on GitHub
node version of stt-align https://github.com/bbc/stt-align by Chris Baume - R&D.
☆13Jul 18, 2023Updated 3 years ago
ebu / ebucore
View on GitHub
ebucore maintenance
☆26Jan 30, 2026Updated 5 months ago
danijel3 / ClarinStudioKaldi
View on GitHub
A baseline Automatic Speech Recognition system for Polish based on Kaldi.
☆18Dec 21, 2021Updated 4 years ago
CoEDL / kaldi_helpers
View on GitHub
A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
☆15May 19, 2020Updated 6 years ago
dabinat / deepspeech-tools
View on GitHub
Scripts to simplify data prepping for Mozilla DeepSpeech.
☆14Aug 6, 2019Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
leichtrhino / ChimeraNet
View on GitHub
Unofficial implementation of music separation model by Luo et.al.
☆13Nov 3, 2019Updated 6 years ago
TimotheeMickus / codwoe
View on GitHub
The CODWOE shared task invites you to compare two types of semantic descriptions: dictionary glosses and word embedding representations. …
☆12Jul 13, 2022Updated 4 years ago
ccoreilly / deepspeech-catala
View on GitHub
Deepspeech ASR Model for the Catalan Language
☆17Feb 15, 2021Updated 5 years ago
helpingstar / gym-woodoku
View on GitHub
🎲 Woodoku-based reinforcement learning environment using Gymnasium
☆10Sep 28, 2024Updated last year
goodmike31 / pl-asr-speech-data-survey
View on GitHub
Survey of available speech datasets for Polish ASR development
☆17Jan 1, 2025Updated last year
ruslan-corpus / ruslan-corpus.github.io
View on GitHub
☆22Aug 29, 2019Updated 6 years ago
robmsmt / SpeechLoop
View on GitHub
Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
☆19Oct 5, 2022Updated 3 years ago
jacobmontiel / sql-intro
View on GitHub
Introductory course to SQL
☆11Oct 26, 2018Updated 7 years ago
neurlang / dataset
View on GitHub
IPA Phonetic dataset lexicon
☆18Jun 20, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
apptek / SubER
View on GitHub
SubER - Subtitle Edit Rate
☆26May 7, 2026Updated 2 months ago
dave-fernandes / SpeakerClassifier
View on GitHub
A random forest classifier to predict the age-group and gender of a speaker from voice measurements.
☆18Apr 30, 2019Updated 7 years ago
idnavid / speech_activity_detection
View on GitHub
Unsupervised speech activity detection system.
☆11Jul 2, 2018Updated 8 years ago
wlzhang2020 / LLMTreeRec
View on GitHub
The implement of LLMTreeRec
☆14Dec 9, 2024Updated last year
Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
OlaPietka / Agglomerative-Hierarchical-Clustering-from-scratch
View on GitHub
Build Agglomerative hierarchical clustering algorithm from scratch, i.e. WITHOUT any advance libraries such as Numpy, Pandas, Scikit-lear…
☆19May 27, 2023Updated 3 years ago
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
MertKalkanci / Highlights-Maker
View on GitHub
A video highlights creator
☆12Jun 1, 2024Updated 2 years ago
kuk / crawl-vk-catalog
View on GitHub
☆16May 19, 2016Updated 10 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
douglasbagnall / nze-vox
View on GitHub
Working towards a free acoustic model for the automatic recognition of New Zealand English
☆19Aug 17, 2012Updated 13 years ago
guillemcortes / baf-dataset
View on GitHub
Reproducibility kit for "BAF: An Audio Fingerprinting Dataset for Broadcast Monitoring" by Guillem Cortès, Álex Ciurana, Emilio Molina, M…
☆35Mar 21, 2023Updated 3 years ago
DinoTheDinosaur / russian_g2p_neuro
View on GitHub
G2P tool for Russian language with vosk-model-ru styled transcriptions
☆10Jun 9, 2021Updated 5 years ago
pyannote / pyannote-metrics
View on GitHub
A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems
☆252May 19, 2026Updated 2 months ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
gladiaio / normalization
View on GitHub
A lightweight library for normalizing speech transcripts before computing WER
☆27Jul 14, 2026Updated last week
watsonbox / sphinxtrain-ruby
View on GitHub
Toolkit for training/adapting CMU Sphinx acoustic models
☆17May 25, 2018Updated 8 years ago
alexnorton / transcript-editor
View on GitHub
Timed-transcript editor component built using Draft.js.
☆46Aug 20, 2018Updated 7 years ago
oTranscribe / oTplayer
View on GitHub
Audio (and video) player for oTranscribe
☆28May 21, 2016Updated 10 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
JorenSix / JGaborator
View on GitHub
Fast Gabor spectral transforms in Java. Using a JNI bridge with the gaborator C++ library.
☆14Jan 20, 2023Updated 3 years ago
motazsaad / ara-pronunciation-tool
View on GitHub
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …
☆15Sep 5, 2017Updated 8 years ago
symblai / speech-recognition-evaluation
View on GitHub
Evaluate results from ASR/Speech-to-Text quickly
☆41Dec 28, 2021Updated 4 years ago
uhh-lt / subtitle2go
View on GitHub
☆25Dec 13, 2023Updated 2 years ago
AccelerateNetworks / DeepSpeech_Frontend
View on GitHub
A webpage and API for using Mozilla DeepSpeech
☆48Feb 24, 2021Updated 5 years ago
pietrop / digital-paper-edit-client
View on GitHub
Work in progress -digital paper edit project - React Client
☆13Jun 25, 2021Updated 5 years ago
sarahjuan / iban
View on GitHub
☆14Jun 12, 2015Updated 11 years ago