jpuigcerver / xerLinks
Compute useful transcriptions metrics (CER, WER, SER, ...)
☆27Updated 11 years ago
Alternatives and similar repositories for xer
Users that are interested in xer are comparing it to the libraries listed below
Sorting:
- ☆38Updated 5 years ago
- An implementation of RNN-Transducer loss in TF-2.0.☆46Updated 2 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆49Updated 4 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆26Updated 5 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated 2 years ago
- An advance kaldi wrapper for Pyhton☆38Updated 4 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Updated 5 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 5 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Updated 3 years ago
- 24-hour Automatic Speech Recognition☆27Updated 4 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated last year
- End to End Dialect Identification using Convolutional Neural Network☆53Updated 6 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆62Updated 2 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 6 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
- ☆76Updated 4 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆74Updated 5 years ago
- neural network based speaker embedder☆25Updated 2 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated 2 years ago
- Multilingual Grapheme to Phoneme☆50Updated 9 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 6 years ago
- SpeechYOLO Interspeech 2019☆46Updated 3 years ago
- Data preparation code for building Kaldi ASR system☆14Updated 8 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 3 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆79Updated 3 years ago
- Pronunciation-assisted Subword Modeling☆31Updated 6 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 4 years ago
- ☆37Updated 3 weeks ago