jpuigcerver/xer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jpuigcerver/xer)

jpuigcerver / xer

Compute useful transcriptions metrics (CER, WER, SER, ...)

☆27

Alternatives and similar repositories for xer

Users that are interested in xer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SimengSun / revisit-nplm
View on GitHub
☆12Sep 1, 2021Updated 4 years ago
Beomi / transformers-language-modeling
View on GitHub
Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3
☆23May 20, 2021Updated 5 years ago
daanzu / wav2vec2_stt_python
View on GitHub
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…
☆23Aug 16, 2021Updated 4 years ago
tmbdev-talks / icdar2019-readings
View on GitHub
☆14Apr 18, 2020Updated 6 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
snunlp / KR-ELECTRA
View on GitHub
KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch
☆15Feb 13, 2022Updated 4 years ago
speech-paper-reading / speech-paper-reading
View on GitHub
Repository for speech paper reading
☆33Aug 19, 2021Updated 4 years ago
ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆16Mar 13, 2024Updated 2 years ago
sooftware / jasper
View on GitHub
PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)
☆32Mar 4, 2021Updated 5 years ago
ElotlMX / Esquite
View on GitHub
Framework para corpus paralelos | Framework for parallel corpora
☆20Jul 14, 2026Updated last week
go-nlp / bm25
View on GitHub
bm25 is a scoring function that helps with information retrieval
☆14Sep 17, 2020Updated 5 years ago
OrcusCZ / NNAcousticModeling
View on GitHub
☆24Sep 25, 2018Updated 7 years ago
Aratako / CALM-DACVAE
View on GitHub
An attempt to reproduce CALM (Continuous Audio Language Models) using DACVAE as the audio VAE.
☆17Feb 20, 2026Updated 5 months ago
sooftware / lightning-asr
View on GitHub
Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.
☆50May 19, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
farisalasmary / wav2vec2-kenlm
View on GitHub
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
☆74Oct 11, 2021Updated 4 years ago
lucky-bai / kaggle-speech-recognition
View on GitHub
TensorFlow Speech Recognition Challenge (Top 15%)
☆14Jan 16, 2018Updated 8 years ago
robd003 / sph2pipe
View on GitHub
provide SPHERE-formatted output as well as RIFF, AU, AIFF and raw
☆14Dec 18, 2021Updated 4 years ago
jackandsnow / craw_government_files
View on GitHub
crawl the public files of different governments through python 3.
☆15Aug 29, 2019Updated 6 years ago
CMsmartvoice / Unet-TTS
View on GitHub
One-shot TTS with Improved Unseen Speaker and Style Transfer
☆37Mar 2, 2022Updated 4 years ago
jindongwang / EasyEspnet
View on GitHub
Making Espnet easier to use
☆54Apr 9, 2021Updated 5 years ago
VITA-Group / Audio-Lottery
View on GitHub
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆32Apr 8, 2022Updated 4 years ago
idiom-bytes / flaskGPT
View on GitHub
Waffer-thin FlaskGPT on Vercel.
☆12Jun 1, 2023Updated 3 years ago
alokprasad / lpctron-tts-cpp
View on GitHub
C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.
☆32Oct 1, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dobby-seo / korean-speech-recognition-quartznet
View on GitHub
Jasper 기반 양자화된 모델인 Quartznet 한국어 음성인식
☆22Jul 21, 2021Updated 5 years ago
pln-fing-udelar / jojajovai
View on GitHub
Jojajovai Guarani-Spanish Parallel Corpus
☆20Jul 5, 2022Updated 4 years ago
aranciokov / FSMMDA_VideoRetrieval
View on GitHub
☆10Nov 23, 2023Updated 2 years ago
juanmc2005 / SpeakerEmbeddingLossComparison
View on GitHub
Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…
☆61Oct 7, 2020Updated 5 years ago
Deepest-Project / FastSpeech
View on GitHub
Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"
☆54Feb 26, 2020Updated 6 years ago
PiotrTa / Huawei-Challenge-Speaker-Identification
View on GitHub
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
☆36Oct 4, 2019Updated 6 years ago
KrishnaDN / BERTphone
View on GitHub
Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"
☆17Dec 10, 2020Updated 5 years ago
YongWookHa / kor-text-preprocess
View on GitHub
Korean text data preprocess toolkit for NLP
☆18Jun 11, 2019Updated 7 years ago
zhjohnchan / bert-clip-synesthesia
View on GitHub
[Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.
☆14Jun 7, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
aixplain / NoRefER
View on GitHub
☆18Jun 5, 2026Updated last month
sarahjuan / iban
View on GitHub
☆14Jun 12, 2015Updated 11 years ago
SesameAILabs / silentcipher
View on GitHub
☆21Mar 17, 2025Updated last year
georgid / Lyrics2AudioAligner
View on GitHub
lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping
☆14Mar 14, 2018Updated 8 years ago
NICE-FUTURE / tfidf-cosine-text-recommendation
View on GitHub
【Demo】对新闻标题使用TF-IDF向量化和cosine相似度计算完成相似标题推荐
☆14Mar 2, 2020Updated 6 years ago
iamjanvijay / rnnt
View on GitHub
An implementation of RNN-Transducer loss in TF-2.0.
☆46Jan 7, 2026Updated 6 months ago
crazycloud / Handwritten-text-Detection-Detectron2
View on GitHub
Handwritten text detection in document images using Detectron2
☆21Dec 1, 2021Updated 4 years ago