diego-fustes/asr-rescoring

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/diego-fustes/asr-rescoring)

diego-fustes / asr-rescoring

Rescoring methods for end-to-end Automatic Speech Recognition

☆27

Alternatives and similar repositories for asr-rescoring

Users that are interested in asr-rescoring are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

amazon-science / contextual-attention-nlm
View on GitHub
Accompanying code for paper "Attention-Based Contextual Language Model Adaptation for Speech Recognition", submitted to ACL 2021.
☆14Jul 25, 2023Updated 3 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
lightning830 / E2E-audio-speech-recognition
View on GitHub
Conformer encoder + Transformer decoder with Hybrid CTC/attention
☆12Nov 11, 2021Updated 4 years ago
NingAnMe / Label-Smoothing-for-CrossEntropyLoss-PyTorch
View on GitHub
add a Arg: label_smoothing for torch.nn.CrossEntropyLoss()
☆14Jan 13, 2021Updated 5 years ago
QuadraV-Speech / AESRC2020
View on GitHub
a deep accent recognition network
☆50Aug 25, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
liutaocode / DiarizationVisualization
View on GitHub
Visualization tools for audio-only and multi-modal speaker diarization dataset
☆13Oct 27, 2023Updated 2 years ago
ZhaoZeyu1995 / BenNevis
View on GitHub
A Diffrentiable WFST-based End-to-End Automatic Speech Recognition toollkit with flexible topology support
☆12Feb 15, 2026Updated 5 months ago
YosukeHiguchi / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆16Jan 20, 2025Updated last year
ZQuang2202 / Zipformer_Lightning
View on GitHub
An upgrade framework for train and validate compare with icefall using Lightning.
☆16Mar 26, 2025Updated last year
kariminf / lang-trans
View on GitHub
Python transliteration library (mostly from non-latin scripts, such as Arabic, Japanese, etc.)
☆20Dec 31, 2018Updated 7 years ago
mailong25 / vietnamese-question-answering
View on GitHub
Vietnamese Question Answering
☆11Sep 17, 2018Updated 7 years ago
carlfm01 / my-speech-datasets
View on GitHub
My public domain speech index
☆13Sep 19, 2019Updated 6 years ago
gengxuelong / wenet_LLM_from_ASLP
View on GitHub
wenet_LLM_from_ASLP
☆15Nov 26, 2024Updated last year
Kirili4ik / QuartzNet-ASR-pytorch
View on GitHub
Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.
☆16Nov 5, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
TeaPoly / Conformer-Athena
View on GitHub
Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.
☆44Nov 2, 2022Updated 3 years ago
jiangqizheng / art
View on GitHub
基于serverless实现的《图片艺术化应用》
☆10Sep 8, 2020Updated 5 years ago
nikhil-vartak / json-to-html-converter
View on GitHub
Converts JSON data to HTML table with collapsible details view for nested objects.
☆14May 1, 2021Updated 5 years ago
speech-paper-reading / speech-paper-reading
View on GitHub
Repository for speech paper reading
☆33Aug 19, 2021Updated 4 years ago
Miamoto / Conformer-NTM
View on GitHub
☆16Nov 9, 2023Updated 2 years ago
ishine / ContextNet
View on GitHub
Tensorflow2 based implementation of ContextNet, an improved convolutional rnn-transducer-based architecture for end-to-end speech recogni…
☆18Oct 19, 2020Updated 5 years ago
eastonYi / Unsupervised-ASR
View on GitHub
unsupervised ASR (mainly phone classifier) using EODM and GAN
☆12Oct 22, 2020Updated 5 years ago
r-dh / dutch-vl-tts
View on GitHub
Free Dutch voice dataset
☆13Jan 28, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
MayankFawkes / transfer.sh
View on GitHub
Transfer.sh command line program, Now file sharing from the command line is easy.
☆13Feb 28, 2023Updated 3 years ago
nsu-ai-team / russian_g2p_neuro
View on GitHub
Experiments with grapheme2phoneme for Russian based on the artificial neural networks
☆20Apr 1, 2021Updated 5 years ago
jindongwang / EasyEspnet
View on GitHub
Making Espnet easier to use
☆54Apr 9, 2021Updated 5 years ago
SebastianBodza / Orpheus_Distributed_FastAPI
View on GitHub
☆15Mar 30, 2026Updated 3 months ago
JRMeyer / easy-kaldi
View on GitHub
Use your data to create a speech recognition system in Kaldi. Fast.
☆65Jan 2, 2020Updated 6 years ago
Taltt / FNSE-SBGAN
View on GitHub
FNSE-SBGAN: Far-field Speech Enhancement with Schrödinger Bridge and Generative Adversarial Networks
☆20May 12, 2025Updated last year
mush42 / mantoq
View on GitHub
Arabic Grapheme-to-Phoneme (G2P) Conversion
☆16Mar 15, 2025Updated last year
TeaPoly / CTC-OptimizedLoss
View on GitHub
Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.
☆59Sep 6, 2023Updated 2 years ago
aispeech-lab / w2v-cif-bert
View on GitHub
☆37Jun 28, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
chuachinhon / wav2vec2_transformers
View on GitHub
Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…
☆32Mar 20, 2021Updated 5 years ago
AsoSoft / AsoSoft-Speech-Corpus
View on GitHub
AsoSoft Speech Corpus can be used for spoken language processing tasks in Central Kurdish such as speech recognition, speaker recognition…
☆10Mar 8, 2022Updated 4 years ago
voithru / voice-activity-detection
View on GitHub
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
☆159Oct 26, 2021Updated 4 years ago
sil-ai / tts-singlish
View on GitHub
TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.
☆11Jan 11, 2020Updated 6 years ago
evuraan / mintPiper
View on GitHub
Make Linux speak what's on the screen: clearly and securely.
☆35Apr 6, 2024Updated 2 years ago
papercup-open-source / subscale-wavernn
View on GitHub
Implementation of the subscale framework from the WaveRNN paper, building on top of Fatchord's WaveRNN repo
☆19Oct 8, 2020Updated 5 years ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago