narVidhai / Speech-Transcription-Benchmarking
Example python scripts to evaluate various ASR methods
☆12Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Speech-Transcription-Benchmarking
- PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper☆12Updated 2 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 3 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆13Updated 5 months ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆18Updated 3 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆13Updated last month
- ☆11Updated 3 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆11Updated 6 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆18Updated last year
- This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…☆9Updated 2 years ago
- A simple command line tool to calculate WER for ASR.☆13Updated last month
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆9Updated last year
- ☆10Updated last year
- End-to-end diarization loss☆22Updated 3 years ago
- acnn for text-independent speaker recognition☆9Updated 2 years ago
- ClearVoice☆13Updated this week
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…☆10Updated 4 years ago
- ☆14Updated last year
- ☆11Updated last year
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆16Updated last year
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- Baseline kaldi script for UA-SPEECH corpus☆29Updated last month
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆12Updated 3 years ago
- C++ version of pyannote audio overlapped speech detection pipeline☆9Updated 9 months ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Updated 11 months ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 4 years ago
- Addressing Text-dependent Speaker Verification Using Singing Speech☆9Updated 5 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- Deploy Kaldi models using grpc for bidirectional streaming.☆17Updated last month