akshanshchaudhry / Speech-Accent-RecognitionLinks
A deep learning model is developed which can predict the native country on the basis of the spoken english accent
☆49Updated 5 years ago
Alternatives and similar repositories for Speech-Accent-Recognition
Users that are interested in Speech-Accent-Recognition are comparing it to the libraries listed below
Sorting:
- Spoken Language assessment☆44Updated 4 years ago
- Repository for Accent Recognition (Hackathon @SLT2022)☆32Updated last year
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Updated 4 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆18Updated 3 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆27Updated last year
- Speaker diarization python system based on binary key speaker modelling☆60Updated 3 years ago
- Code for AccentDB.☆22Updated 4 years ago
- Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"☆12Updated 5 years ago
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆44Updated 4 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 4 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆102Updated 4 months ago
- A Python toolbox for speech features extraction☆163Updated 2 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated 2 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- Various speech datasets made available to the public☆122Updated 6 months ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆16Updated 5 years ago
- Feature extractor for DL speech processing.☆65Updated 3 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Updated 5 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 6 years ago
- A deep learning model for classifying audio frames into [SPEECH, KCHI, CHI, MAL, FEM] classes.☆44Updated last week
- The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines …☆61Updated 3 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Updated 2 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆143Updated last year
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆67Updated 3 years ago
- a deep accent recognition network☆48Updated 3 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago