ebu / benchmarkstt
Open Source AI Benchmarking toolkit for benchmarking speech to text services
☆55Updated 11 months ago
Alternatives and similar repositories for benchmarkstt:
Users that are interested in benchmarkstt are comparing it to the libraries listed below
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Audiobook alignment for Indigenous languages☆39Updated 3 weeks ago
- Python library for handling audio datasets.☆136Updated last year
- Language data store and linguistic query API☆39Updated this week
- ☆13Updated last year
- Evaluate results from ASR/Speech-to-Text quickly☆36Updated 3 years ago
- Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora☆28Updated 2 years ago
- Coqui Inference Engine☆38Updated 3 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- Python wrapper for phonetisaurus grapheme to phoneme tool☆12Updated 4 years ago
- Crawling and creating a German language model resource☆19Updated 2 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆37Updated 2 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆80Updated 2 years ago
- ☆39Updated last week
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆35Updated 3 years ago
- Command line tool to create corpora for Common Voice☆75Updated 9 months ago
- Gamma Agreement in Python☆43Updated last year
- 🙊 software for creating speech recognition models.☆158Updated 9 months ago
- Linguistic processing for Common Voice☆53Updated last year
- Python module for syllabifying English ARPABET transcriptions☆66Updated 6 years ago
- The EMU-webApp is an online and offline web application for labeling, visualizing and correcting speech and derived speech data.☆51Updated 6 months ago
- Labeled data for homograph disambiguation☆56Updated last year
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆105Updated last month
- Unicode Standard tokenization routines and orthography profile segmentation☆35Updated last month
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 4 years ago
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆173Updated last year
- Scripts for training Kaldi for German speech recognition (ASR).☆24Updated 4 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Updated 3 years ago
- Automatic prosodic annotation tool written in Java.☆60Updated 5 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆64Updated 4 years ago