symblai / speech-recognition-evaluation
Evaluate results from ASR/Speech-to-Text quickly
☆36Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for speech-recognition-evaluation
- ☆34Updated 10 months ago
- An online speech recognition extension toolkit of Kaldi☆57Updated 3 years ago
- Various speech datasets made available to the public☆99Updated last month
- Online streaming speaker change detection model in Pytorch☆36Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆44Updated 4 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆84Updated last month
- An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.☆157Updated 3 weeks ago
- ☆32Updated 2 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆141Updated 6 months ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆37Updated 2 years ago
- ☆50Updated last year
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆34Updated 4 years ago
- Tunable pipelines☆30Updated last month
- This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…☆61Updated 4 months ago
- Clustering-based methods for overlapping diarization☆70Updated 10 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆100Updated last year
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated last year
- Speaker diarization python system based on binary key speaker modelling☆61Updated 2 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆71Updated last year
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Predicts the level of noise and reverberation on your audiofiles☆138Updated 6 months ago
- MeetEval - A meeting transcription evaluation toolkit☆78Updated 3 weeks ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆67Updated last year
- The Gridspace-Stanford Harper Valley speech dataset. Created in support of CS224S.☆42Updated 3 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Updated 4 years ago
- ☆40Updated last year
- ☆66Updated last year
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"☆111Updated 2 years ago