This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 2021.
☆49Dec 25, 2024Updated last year
Alternatives and similar repositories for Robust-E2E-ASR
Users that are interested in Robust-E2E-ASR are comparing it to the libraries listed below
Sorting:
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆43May 23, 2023Updated 2 years ago
- PyTorch implementation of "Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition"☆19Jul 19, 2019Updated 6 years ago
- ☆10Oct 20, 2022Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…☆16Oct 22, 2022Updated 3 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆18Dec 1, 2024Updated last year
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Jun 27, 2020Updated 5 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆21Jul 26, 2021Updated 4 years ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…☆20Apr 1, 2022Updated 3 years ago
- ☆74Apr 4, 2024Updated last year
- ☆14Nov 26, 2024Updated last year
- Transformer based ASR Engine.☆13Aug 23, 2021Updated 4 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- A CSRankings-like index for speech researchers☆35Oct 16, 2024Updated last year
- J-Net is aimed for audio separation with randomly weighted encoder.☆12Oct 23, 2019Updated 6 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 4 years ago
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆219Jun 22, 2023Updated 2 years ago
- ☆37Jun 28, 2021Updated 4 years ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆56Oct 9, 2020Updated 5 years ago
- ☆17Mar 30, 2023Updated 2 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- ☆16Nov 9, 2023Updated 2 years ago
- 端到端语音识别实现;包含LAS、CTC、RNNT解码方式,模型SA(MHA)、LSTM、CNN、DFSMN等☆15Jun 4, 2021Updated 4 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- ☆15Aug 25, 2022Updated 3 years ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆48Nov 8, 2023Updated 2 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆46Jul 3, 2025Updated 8 months ago
- Framework for Detection Evaluation (F4DE) : set of evaluation tools for detection evaluations and for specific NIST-coordinated evaluatio…☆25Jul 6, 2017Updated 8 years ago
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆32Mar 14, 2025Updated 11 months ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Oct 8, 2021Updated 4 years ago
- ☆18Mar 13, 2024Updated last year
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…☆66Feb 26, 2024Updated 2 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆22Dec 5, 2022Updated 3 years ago
- Crowdsourced and Automatic Speech Prominence Estimation☆25Apr 12, 2024Updated last year
- Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Human Language Processing Special Project)☆17Nov 22, 2020Updated 5 years ago