robflynnyh/long-context-asr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/robflynnyh/long-context-asr)

robflynnyh / long-context-asr

Code for the paper: How Much Context Does My Attention-Based ASR System Need?

☆11

Alternatives and similar repositories for long-context-asr

Users that are interested in long-context-asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
desh2608 / pytorch-tdnn
View on GitHub
Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training
☆41Dec 18, 2020Updated 5 years ago
homink / kaldi-asr.forced_decoding
View on GitHub
Perform the forced decoding with target transcription
☆11Sep 12, 2018Updated 7 years ago
MingLunHan / CIF-PyTorch
View on GitHub
[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…
☆78Jul 14, 2026Updated 2 weeks ago
kjw11 / CSEnet-ASR
View on GitHub
Cross-Speaker Encoding Network for Multi-talker Speech Recognition
☆12Mar 14, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
atosystem / SSL_Interface
View on GitHub
Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024
☆16Nov 19, 2024Updated last year
SUNGBEOMCHOI / Korean-Streaming-ASR
View on GitHub
Korean Streaming ASR(with Denoiser and Conformer CTC)
☆45Apr 28, 2024Updated 2 years ago
WangHelin1997 / Aty-TTS
View on GitHub
Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
☆11May 14, 2025Updated last year
alecokas / BiLatticeRNN-Confidence
View on GitHub
Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks https://arxiv.org/abs/19…
☆14Apr 16, 2020Updated 6 years ago
EmanElrefai / Islamic-chatbot
View on GitHub
Chatbot using NLTK & Keras Deep Learning
☆13May 19, 2020Updated 6 years ago
AUSTIN2526 / iThome2023-learn-NLP-in-30-days
View on GitHub
2023年iThome鐵人賽「AI & Data」組佳作【30天內成為NLP大師：掌握關鍵工具和技巧】完整程式碼，該文章會從零開始教你該如何微調大型語言模型
☆18Nov 21, 2024Updated last year
JiuFengSC / ElasticAST
View on GitHub
Official code of ElasticAST (Interspeech 2024 paper)
☆34Jul 30, 2024Updated last year
ylongqi / podcast-data-modeling
View on GitHub
More than Just Words: Modeling Non-textual Characteristics of Podcasts
☆26Nov 6, 2019Updated 6 years ago
suzuki256 / dog-dataset
View on GitHub
☆47Jul 15, 2022Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ZhaoZeyu1995 / BenNevis
View on GitHub
A Diffrentiable WFST-based End-to-End Automatic Speech Recognition toollkit with flexible topology support
☆12Feb 15, 2026Updated 5 months ago
WingZLeung / TTDS
View on GitHub
Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.
☆13Mar 15, 2025Updated last year
kaiidams / voice100
View on GitHub
Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregr…
☆28Nov 23, 2023Updated 2 years ago
YiwenShaoStephen / pychain_example
View on GitHub
☆48Jan 8, 2021Updated 5 years ago
stepelu / idbm-pytorch
View on GitHub
☆13Sep 13, 2023Updated 2 years ago
ag1988 / mel-asr
View on GitHub
The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George…
☆21Oct 11, 2024Updated last year
bootphon / sustained-phonation-features
View on GitHub
Python package for the extraction of speech features for sustained phonation
☆12Aug 10, 2020Updated 5 years ago
kaistmm / FlowAVSE
View on GitHub
☆27Jul 15, 2024Updated 2 years ago
SimengSun / ChapterBreak
View on GitHub
☆12Jun 5, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Veleslavia / vimss
View on GitHub
Visually-informed Music Source Separation project at Jeju 2018 Deep Learning Summer Camp
☆30Sep 14, 2018Updated 7 years ago
elianap / divexplorer
View on GitHub
☆11May 5, 2022Updated 4 years ago
zzpDapeng / Transformer-Transducer
View on GitHub
A streamable speech recognition model with transformer encoders and RNN-T loss
☆11Mar 1, 2021Updated 5 years ago
aispeech-lab / w2v-cif-bert
View on GitHub
☆37Jun 28, 2021Updated 5 years ago
camenduru / nvidia-llm-colab
View on GitHub
☆14Jul 25, 2023Updated 3 years ago
bene-ges / nemo_compatible
View on GitHub
useful things that work with NVIDIA NeMo library
☆14Jan 20, 2024Updated 2 years ago
Sea-Snell / MLLibCpp
View on GitHub
A machine learning library capable of training various deep neural networks (RNNs, LSTMs, DBNs, ect...) on a GPU. It makes use of auto-di…
☆10Aug 28, 2018Updated 7 years ago
revdotcom / speech-datasets
View on GitHub
Various speech datasets made available to the public
☆136May 29, 2026Updated 2 months ago
lsj2408 / URPE
View on GitHub
[NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)
☆35Aug 6, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
zsLin177 / CopyNE
View on GitHub
☆20Jun 3, 2024Updated 2 years ago
shoutOutYangJie / FlowNet1.0-using-Keras
View on GitHub
This model's weights are converted from Flownet of Nvidia
☆12Jun 25, 2019Updated 7 years ago
EllaBot / true-online-td-lambda
View on GitHub
Implementation of True Online TD(lambda) with a Fourier Basis function approximator.
☆13May 9, 2015Updated 11 years ago
otnemrasordep / ProgGP
View on GitHub
A dataset of 173 progressive metal songs, in both GuitarPro and token formats, as per the specifications in DadaGP.
☆18Nov 19, 2024Updated last year
lancopku / SACT
View on GitHub
Code for the article "Automatic Temperature Control for Neural Machine Translation" (EMNLP 2018)
☆14Apr 16, 2019Updated 7 years ago
Curt-Park / triton-inference-server-practice
View on GitHub
Archives for Triton Inference Server Practices
☆15Feb 28, 2022Updated 4 years ago
Kirili4ik / QuartzNet-ASR-pytorch
View on GitHub
Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.
☆16Nov 5, 2020Updated 5 years ago