jpuigcerver / xerLinks
Compute useful transcriptions metrics (CER, WER, SER, ...)
☆27Updated 11 years ago
Alternatives and similar repositories for xer
Users that are interested in xer are comparing it to the libraries listed below
Sorting:
- An implementation of RNN-Transducer loss in TF-2.0.☆46Updated 2 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆26Updated 5 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 5 years ago
- ☆38Updated 5 years ago
- ☆76Updated 4 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated 2 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated 2 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆49Updated 4 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- ☆37Updated 3 weeks ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated last year
- neural network based speaker embedder☆25Updated 2 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆62Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
- ☆21Updated 7 years ago
- Pronunciation-assisted Subword Modeling☆31Updated 6 years ago
- A handy dataset of noises for ASR☆22Updated 6 years ago
- End to End Dialect Identification using Convolutional Neural Network☆53Updated 6 years ago
- Word Error Rate Estimation☆15Updated 5 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆61Updated 5 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Updated 3 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 3 years ago
- Multilingual Grapheme to Phoneme☆50Updated 9 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 4 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Updated 3 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 5 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆74Updated 5 years ago
- CTC Decoder implementation with python only. Also supports language model decoding using KenLM.☆37Updated last year