k-farruh / speech-accent-detectionLinks
The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines accent based audio record. The result of the model could be used to determine accents and help decrease accents to English learning students and improve accents by training.
☆62Updated 4 years ago
Alternatives and similar repositories for speech-accent-detection
Users that are interested in speech-accent-detection are comparing it to the libraries listed below
Sorting:
- Reproducible experimental protocols for multimedia (audio, video, text) database☆107Updated 2 months ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated 2 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆55Updated 5 months ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
- ☆67Updated 5 months ago
- A curated list of awesome voice activity detection☆68Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆91Updated 2 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆261Updated last year
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆90Updated 7 months ago
- Python forced alignment☆94Updated last year
- Various speech datasets made available to the public☆129Updated 11 months ago
- SelfRemaster: SSL Speech Restoration☆93Updated last year
- Collection of pretrained models for the Montreal Forced Aligner☆177Updated last month
- A non-native English corpus for pronunciation scoring task☆159Updated 3 weeks ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆67Updated 3 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆108Updated 2 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Updated 2 years ago
- Toolbox for easy and qualitative one-shot voice conversion☆46Updated 3 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆172Updated 2 years ago
- ☆69Updated last month
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆263Updated 10 months ago
- Finetuning VITS Efficiently☆33Updated 2 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆147Updated 5 months ago
- Putting flows on top of neural transducers for better TTS☆64Updated last week
- Predicts the level of noise and reverberation on your audiofiles☆169Updated 5 months ago
- This project is about performing Speaker diarization for Hindi Language.☆54Updated 4 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆151Updated last year