idiap/contextual-biasing-on-gpus

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/idiap/contextual-biasing-on-gpus)

idiap / contextual-biasing-on-gpus

Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech 2023.

☆21

Alternatives and similar repositories for contextual-biasing-on-gpus

Users that are interested in contextual-biasing-on-gpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

idiap / pkwrap
View on GitHub
A pytorch wrapper for LF-MMI training and parallel training in Kaldi
☆74Jun 8, 2022Updated 4 years ago
tzyll / ChineseHP
View on GitHub
Dataset for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models in Interspeech 2024.
☆16Jul 4, 2024Updated 2 years ago
glecorve / rnnlm2wfst
View on GitHub
Conversion of recurrent neural network language models to weighted finite state transducers
☆58Jun 1, 2018Updated 8 years ago
jfainberg / lattice_combination
View on GitHub
Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices
☆16Mar 19, 2024Updated 2 years ago
ag1988 / mel-asr
View on GitHub
The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George…
☆21Oct 11, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
JuanPZuluaga / accent-recog-slt2022
View on GitHub
Repository for Accent Recognition (Hackathon @SLT2022)
☆43May 12, 2024Updated 2 years ago
tencent-ailab / 3m-asr
View on GitHub
3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
☆119Jun 22, 2022Updated 4 years ago
artbataev / end2end
View on GitHub
Losses and decoders for end-to-end ASR and OCR
☆34Oct 30, 2020Updated 5 years ago
jasonppy / PromptingWhisper
View on GitHub
Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation
☆151Jan 16, 2024Updated 2 years ago
hipudding / pytorch-lightning
View on GitHub
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
☆13Feb 20, 2024Updated 2 years ago
german-asr / kaldi-german
View on GitHub
Scripts for training Kaldi for German speech recognition (ASR).
☆27Feb 11, 2021Updated 5 years ago
idiap / icassp-oov-recognition
View on GitHub
Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"
☆17Nov 28, 2021Updated 4 years ago
TowerYsable / ASR_awesome
View on GitHub
语音识别论文前沿
☆53Jan 8, 2022Updated 4 years ago
YosukeHiguchi / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆16Jan 20, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
DataXujing / ASR-paper
View on GitHub
ASR教程: https://dataxujing.github.io/ASR-paper/
☆26Jul 1, 2024Updated 2 years ago
idiap / bert-text-diarization-atc
View on GitHub
This is a repository for a paper accepted at the 2022 IEEE Spoken Language Technology Workshop (SLT 2022)
☆17Dec 1, 2022Updated 3 years ago
kyutai-labs / tts_longeval
View on GitHub
☆30Apr 29, 2026Updated 2 months ago
danpovey / pocolm
View on GitHub
Small language toolkit for creation, interpolation and pruning of ARPA language models
☆92Aug 6, 2022Updated 3 years ago
idiap / zff_vad
View on GitHub
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
☆23Oct 19, 2023Updated 2 years ago
bubaimaji / cmt-mser
View on GitHub
"MULTIMODAL EMOTION RECOGNITION BASED ON DEEP TEMPORAL FEATURES USING CROSS-MODAL TRANSFORMER AND SELF-ATTENTION" ICASSP'23
☆24Feb 26, 2023Updated 3 years ago
DataXujing / TTS-paper
View on GitHub
🔥 语音合成（TTS）,语音克隆教程: https://dataxujing.github.io/TTS-paper/#/
☆11Oct 29, 2024Updated last year
jhuang448 / MultilingualALT
View on GitHub
Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""
☆15Jun 28, 2024Updated 2 years ago
laboroai / LaboroTVSpeech
View on GitHub
☆90Mar 5, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tongshuangwu / llm-crowdsourcing-pipeline
View on GitHub
☆11Jul 6, 2023Updated 3 years ago
maelfabien / EM_GMM_HMM
View on GitHub
Illustrating EM for GMMs and HMMs
☆12May 9, 2020Updated 6 years ago
open-speech / kaldi-io
View on GitHub
c++ Kaldi IO lib (static and dynamic).
☆25Nov 26, 2018Updated 7 years ago
mzboito / IWSLT2022_Tamasheq_data
View on GitHub
Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…
☆18Nov 30, 2022Updated 3 years ago
snsun / kaldi-decoder-code-reading
View on GitHub
☆33Oct 28, 2022Updated 3 years ago
nlp-waseda / traveling-across-languages
View on GitHub
Official repo and evaluation implementation of KnowRecall and VisRecall
☆10May 22, 2025Updated last year
IMLHF / SpecAugmentPyTorch
View on GitHub
A Pytorch (support batch and channel) implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech…
☆11Jul 24, 2024Updated 2 years ago
JRMeyer / easy-kaldi
View on GitHub
Use your data to create a speech recognition system in Kaldi. Fast.
☆65Jan 2, 2020Updated 6 years ago
k2-fsa / kaldi-decoder
View on GitHub
Decoders from Kaldi using OpenFst
☆35Apr 10, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
NUS-HPC-AI-Lab / MoST
View on GitHub
MoST: Mixing Speech and Text with Modality-Aware Mixture of Experts
☆33Jan 15, 2026Updated 6 months ago
ditto-tts / ditto-tts.github.io
View on GitHub
Official Demo Page for DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer
☆38Feb 17, 2025Updated last year
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
k2kobayashi / Shifter
View on GitHub
Pitch shifter using WSOLA and resampling implemented by Python3
☆40Jul 19, 2017Updated 9 years ago
lourson1091 / audiobertscore
View on GitHub
☆15Nov 10, 2025Updated 8 months ago
csukuangfj / kaldifeat
View on GitHub
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…
☆215Jul 10, 2026Updated 2 weeks ago
MingLunHan / CIF-HieraDist
View on GitHub
[INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation
☆41Jul 14, 2026Updated last week