pika-online / AESRC2020View external linksLinks
a deep accent recognition network
☆49Aug 25, 2021Updated 4 years ago
Alternatives and similar repositories for AESRC2020
Users that are interested in AESRC2020 are comparing it to the libraries listed below
Sorting:
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆56Oct 9, 2020Updated 5 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆26Sep 23, 2020Updated 5 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Jun 27, 2020Updated 5 years ago
- Example python scripts to evaluate various ASR methods☆11Dec 22, 2021Updated 4 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- English ASR Challenge organized by Speech Lab, IIT Madras☆11Feb 3, 2021Updated 5 years ago
- A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…☆10Dec 25, 2019Updated 6 years ago
- A deep learning model is developed which can predict the native country on the basis of the spoken english accent☆51Feb 24, 2020Updated 5 years ago
- This is a repository for a paper accepted at the 2022 IEEE Spoken Language Technology Workshop (SLT 2022)☆16Dec 1, 2022Updated 3 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Oct 22, 2020Updated 5 years ago
- Empirical Evaluation of Speaker Adaptation on DNN based Acoustic Model☆13Nov 25, 2019Updated 6 years ago
- Experiments on speech recognition robustness to accents and dialects☆12Apr 2, 2019Updated 6 years ago
- Prototype German Computer-Assisted Pronunciation Training tool for lexical stress errors☆12Oct 28, 2015Updated 10 years ago
- radiomixer☆14Feb 16, 2022Updated 4 years ago
- ☆65Jan 6, 2023Updated 3 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- A deepfake audio dataset for detecting fake speech from codec-based speech synthesis systems, Interspeech 2024☆20Jul 27, 2024Updated last year
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆20Nov 19, 2024Updated last year
- Crowdsourced and Automatic Speech Prominence Estimation☆24Apr 12, 2024Updated last year
- This is a repository for a paper accepted at the 2022 IEEE Spoken Language Technology Workshop (SLT 2022)☆41Jul 10, 2024Updated last year
- ☆21Jul 15, 2024Updated last year
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Mar 7, 2021Updated 4 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆20Jul 21, 2020Updated 5 years ago
- ☆40Aug 15, 2021Updated 4 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆46Jul 3, 2025Updated 7 months ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆48Nov 8, 2023Updated 2 years ago
- keras+bi-lstm+crf,中文命名实体识别☆17Sep 15, 2018Updated 7 years ago
- Download and create a tfreader for the audioset dataset☆16Apr 16, 2020Updated 5 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion☆20Jul 9, 2019Updated 6 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…☆20Apr 1, 2022Updated 3 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Nov 28, 2021Updated 4 years ago
- This repository presents an evaluation framework for speech-to-speech (S2S) models, following the methodology described in the EmphAsses …☆24Jan 9, 2024Updated 2 years ago
- Instructions on downloading and using the LibriAdapt dataset☆46Aug 13, 2021Updated 4 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Dec 25, 2024Updated last year
- End-to-End Mispronunciation Detection via wav2vec2.0☆51Dec 7, 2021Updated 4 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Aug 24, 2023Updated 2 years ago