Code for the paper: How Much Context Does My Attention-Based ASR System Need?
☆11Jun 9, 2026Updated last week
Alternatives and similar repositories for long-context-asr
Users that are interested in long-context-asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Jun 13, 2022Updated 4 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Dec 18, 2020Updated 5 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- [ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…☆78Jan 9, 2025Updated last year
- ☆10Jul 24, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆27Jul 15, 2024Updated last year
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆16Nov 19, 2024Updated last year
- Korean Streaming ASR(with Denoiser and Conformer CTC)☆45Apr 28, 2024Updated 2 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated last year
- 2023年iThome鐵人賽「AI & Data」組佳作【30天內成為NLP大師:掌握關鍵工具和技巧】完整程式碼,該文章會從零開始教你該如何微調大型語言模型☆18Nov 21, 2024Updated last year
- More than Just Words: Modeling Non-textual Characteristics of Podcasts☆26Nov 6, 2019Updated 6 years ago
- A Diffrentiable WFST-based End-to-End Automatic Speech Recognition toollkit with flexible topology support☆12Feb 15, 2026Updated 4 months ago
- Official code of ElasticAST (Interspeech 2024 paper)☆34Jul 30, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆46Jul 15, 2022Updated 3 years ago
- A repo that builds text to music datasets from scratch, used in MuseContorlLite [ICML2025]☆28May 20, 2025Updated last year
- Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks https://arxiv.org/abs/19…☆14Apr 16, 2020Updated 6 years ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆13Mar 15, 2025Updated last year
- Python package for the extraction of speech features for sustained phonation☆12Aug 10, 2020Updated 5 years ago
- ☆48Jan 8, 2021Updated 5 years ago
- Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregr…☆28Nov 23, 2023Updated 2 years ago
- ☆15Mar 15, 2022Updated 4 years ago
- ☆13Sep 13, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George…☆21Oct 11, 2024Updated last year
- Speedup the attention computation of Swin Transformer☆32Jun 14, 2025Updated last year
- ☆11May 5, 2022Updated 4 years ago
- Chatbot using NLTK & Keras Deep Learning☆13May 19, 2020Updated 6 years ago
- A streamable speech recognition model with transformer encoders and RNN-T loss☆11Mar 1, 2021Updated 5 years ago
- ☆12Jun 5, 2024Updated 2 years ago
- ☆37Jun 28, 2021Updated 4 years ago
- Whisper Speech Quality Assessment (WhiSQA)☆16Apr 14, 2026Updated 2 months ago
- ☆20Jun 3, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14Jul 25, 2023Updated 2 years ago
- A machine learning library capable of training various deep neural networks (RNNs, LSTMs, DBNs, ect...) on a GPU. It makes use of auto-di…☆10Aug 28, 2018Updated 7 years ago
- Visually-informed Music Source Separation project at Jeju 2018 Deep Learning Summer Camp☆30Sep 14, 2018Updated 7 years ago
- Various speech datasets made available to the public☆135May 29, 2026Updated 3 weeks ago
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)☆34Aug 6, 2023Updated 2 years ago
- Implementation of True Online TD(lambda) with a Fourier Basis function approximator.☆13May 9, 2015Updated 11 years ago
- This model's weights are converted from Flownet of Nvidia☆12Jun 25, 2019Updated 6 years ago