DanielLin94144 / Test-time-adaptation-ASR-SUTAView external linksLinks
Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition" paper.
☆20Apr 1, 2022Updated 3 years ago
Alternatives and similar repositories for Test-time-adaptation-ASR-SUTA
Users that are interested in Test-time-adaptation-ASR-SUTA are comparing it to the libraries listed below
Sorting:
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆45Jan 24, 2026Updated 2 weeks ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 4 years ago
- ☆18Apr 12, 2017Updated 8 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Mar 7, 2021Updated 4 years ago
- Official PyTorch implementation of SGEM: Test-Time Adaptation for Automatic Speech Recognition via Sequential-Level Generalized Entropy M…☆37Aug 27, 2024Updated last year
- [MICCAI'22] Test-time Adaptation with Calibration of Medical Image Classification Nets for Label Distribution Shift☆21Oct 28, 2022Updated 3 years ago
- Online streaming speaker change detection model in Pytorch☆44Apr 14, 2023Updated 2 years ago
- ☆30Oct 29, 2024Updated last year
- ☆11Sep 1, 2024Updated last year
- [NeurIPS] TTT++: When Does Self-supervised Test-time Training Fail or Thrive?☆73Feb 14, 2022Updated 3 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 10 months ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.☆12Nov 30, 2021Updated 4 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Sep 2, 2024Updated last year
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Instructions on downloading and using the LibriAdapt dataset☆46Aug 13, 2021Updated 4 years ago
- Recipe for LibriPhrase☆33Sep 2, 2023Updated 2 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Dec 25, 2024Updated last year
- Pronunciation-assisted Subword Modeling☆31May 30, 2019Updated 6 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Oct 22, 2020Updated 5 years ago
- ASCEND Chinese-English code-switching dataset☆30Jul 12, 2022Updated 3 years ago
- Code for ICML 2022 paper — Efficient Test-Time Model Adaptation without Forgetting☆135May 19, 2023Updated 2 years ago
- 端到端语音识别实现;包含LAS、CTC、RNNT解码方式,模型SA(MHA)、LSTM、CNN、DFSMN等☆15Jun 4, 2021Updated 4 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Mar 30, 2020Updated 5 years ago
- E2E ASR system☆14Oct 20, 2022Updated 3 years ago
- [ICML 2022] ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks☆15May 18, 2022Updated 3 years ago
- ☆16Nov 9, 2023Updated 2 years ago
- Repository for speech paper reading☆33Aug 19, 2021Updated 4 years ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆59Sep 6, 2023Updated 2 years ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Aug 29, 2023Updated 2 years ago
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Feb 16, 2023Updated 2 years ago
- Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'☆15Jan 20, 2025Updated last year
- ☆13Feb 5, 2022Updated 4 years ago