nkrao220 / accent-classification
Accent Classification in Speech
☆24Updated 5 years ago
Related projects: ⓘ
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 3 years ago
- ☆40Updated last year
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆18Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆57Updated 3 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆34Updated last year
- ☆16Updated 3 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 3 years ago
- ☆31Updated 2 weeks ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆71Updated 2 years ago
- Forced Alignments for Common Voice☆29Updated 3 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆20Updated 3 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 2 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆13Updated 4 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated last year
- Phonetically-Oriented Word Error Rate☆31Updated 5 years ago
- Linguistic processing for Common Voice☆50Updated 8 months ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆45Updated 4 months ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆17Updated 2 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated last year
- Online streaming speaker change detection model in Pytorch☆34Updated last year
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆24Updated 5 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆63Updated 2 years ago
- ☆38Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆39Updated 2 months ago
- ☆26Updated 2 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆12Updated last year
- A CSRankings-like index for speech researchers☆30Updated last year
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆85Updated last year
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆36Updated last year
- A list of papers for child ASR☆24Updated 5 months ago