symblai / speech-recognition-evaluation
Evaluate results from ASR/Speech-to-Text quickly
☆37Updated 3 years ago
Alternatives and similar repositories for speech-recognition-evaluation
Users that are interested in speech-recognition-evaluation are comparing it to the libraries listed below
Sorting:
- An online speech recognition extension toolkit of Kaldi☆56Updated 3 years ago
- ☆39Updated last year
- Various speech datasets made available to the public☆118Updated 5 months ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆18Updated 3 years ago
- ☆36Updated 2 weeks ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆114Updated 2 years ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16Updated 4 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- ☆43Updated 2 years ago
- BurrMill core☆21Updated 3 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆100Updated 3 months ago
- An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.☆167Updated 2 weeks ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆144Updated last year
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆20Updated 3 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- Properly handle position-dependent phones in a subword lexicon FST☆31Updated 4 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆59Updated 4 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆71Updated 8 months ago
- This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…☆104Updated 10 months ago
- MeetEval - A meeting transcription evaluation toolkit☆96Updated this week
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Updated last year
- Online streaming speaker change detection model in Pytorch☆39Updated 2 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- Multistream CNN for Robust Acoustic Modeling☆40Updated 3 years ago
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆135Updated last year
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- Voice Activity Detection (VAD) using deep learning.☆196Updated 5 years ago