Open Source AI Benchmarking toolkit for benchmarking speech to text services
☆58Apr 17, 2024Updated last year
Alternatives and similar repositories for benchmarkstt
Users that are interested in benchmarkstt are comparing it to the libraries listed below
Sorting:
- Maintenance and development of the EBUCorePlus☆27Dec 15, 2025Updated 3 months ago
- ebucore maintenance☆25Jan 30, 2026Updated last month
- node version of stt-align https://github.com/bbc/stt-align by Chris Baume - R&D.☆13Jul 18, 2023Updated 2 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Dec 21, 2021Updated 4 years ago
- Crawling and creating a German language model resource☆18Aug 23, 2022Updated 3 years ago
- IPA Phonetic dataset lexicon☆18Mar 14, 2026Updated last week
- Subtitling Conversion Framework☆58Nov 16, 2020Updated 5 years ago
- Working towards a free acoustic model for the automatic recognition of New Zealand English☆19Aug 17, 2012Updated 13 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15May 19, 2020Updated 5 years ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Aug 6, 2019Updated 6 years ago
- MCMA libraries for Node.js☆11Mar 9, 2026Updated last week
- SubER - Subtitle Edit Rate☆24Feb 19, 2026Updated last month
- ☆14Jun 12, 2015Updated 10 years ago
- INACTIVE - http://mzl.la/ghe-archive - Tools to create ARPA models from cmu pocketsphinx dictionaries for proper g2p generation☆21Mar 29, 2019Updated 6 years ago
- These are preset files for FFMPEG for conversion of a video to MP4☆18Nov 1, 2012Updated 13 years ago
- Unofficial implementation of music separation model by Luo et.al.☆13Nov 3, 2019Updated 6 years ago
- This is a POC of a video server which uses WebRTC to publish a videostream and HTML/CSS/JS for graphik overlays☆15Apr 20, 2020Updated 5 years ago
- A fork of Idiap Research Institute's DiarTk diarization toolkit☆16Feb 20, 2016Updated 10 years ago
- Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).☆283Aug 15, 2023Updated 2 years ago
- Deepspeech ASR Model for the Catalan Language☆17Feb 15, 2021Updated 5 years ago
- 🎲 Woodoku-based reinforcement learning environment using Gymnasium☆10Sep 28, 2024Updated last year
- ☆21Aug 29, 2019Updated 6 years ago
- Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?☆19Oct 5, 2022Updated 3 years ago
- Introductory course to SQL☆11Oct 26, 2018Updated 7 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆62Jan 14, 2026Updated 2 months ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- Media Fragments URI is a W3C specification with the objective to provide for media-format independent, standard means of addressing media…☆45Jul 31, 2016Updated 9 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- A random forest classifier to predict the age-group and gender of a speaker from voice measurements.☆18Apr 30, 2019Updated 6 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- SMTPE Timecode conversion☆17Mar 1, 2023Updated 3 years ago
- ☆17Jun 30, 2020Updated 5 years ago
- ☆20Nov 3, 2021Updated 4 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Dec 12, 2019Updated 6 years ago
- Example implementations using the MCMA Node.js libraries☆17Apr 9, 2022Updated 3 years ago
- A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress☆613Feb 12, 2024Updated 2 years ago