narVidhai / Speech-Transcription-Benchmarking
Example python scripts to evaluate various ASR methods
☆12Updated 3 years ago
Alternatives and similar repositories for Speech-Transcription-Benchmarking:
Users that are interested in Speech-Transcription-Benchmarking are comparing it to the libraries listed below
- ☆12Updated 3 weeks ago
- PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper☆12Updated 2 years ago
- A semi-supervised sequence-to-sequence ASR☆10Updated 2 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- Text frontend for ESPnet tts recipes☆31Updated 3 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆15Updated 4 months ago
- ☆11Updated 3 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆14Updated 2 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Updated last year
- Deploy Kaldi models using grpc for bidirectional streaming.☆17Updated 4 months ago
- ☆25Updated 2 years ago
- This is the experimental description of MnTTS2.☆9Updated 10 months ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆17Updated 2 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 5 years ago
- Transfer learning approach to pronunciation scoring☆10Updated last year
- ☆17Updated last year
- A handy dataset of noises for ASR☆19Updated 5 years ago
- Simple Kaldi recipe for forced alignment☆10Updated last year
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Updated last year
- FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS☆20Updated 2 years ago
- A simple command line tool to calculate WER for ASR.☆14Updated 4 months ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 4 years ago
- kaldi cnn-tdnnf baseline☆13Updated 3 years ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆12Updated last month
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 3 years ago
- Speechflow for emotion recognition related information decomposition☆10Updated 3 years ago
- Addressing Text-dependent Speaker Verification Using Singing Speech☆9Updated 5 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"☆13Updated 4 years ago