k-farruh / speech-accent-detection
The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines accent based audio record. The result of the model could be used to determine accents and help decrease accents to English learning students and improve accents by training.
☆60Updated 3 years ago
Alternatives and similar repositories for speech-accent-detection:
Users that are interested in speech-accent-detection are comparing it to the libraries listed below
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- AdaSpeech: Adaptive Text to Speech for Custom Voice☆157Updated 3 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆199Updated 2 years ago
- SelfRemaster: SSL Speech Restoration☆88Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆98Updated last month
- Speaker change detection using SincNet and an LSTM/Transformer☆48Updated 8 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆82Updated last year
- ☆66Updated 3 months ago
- Putting flows on top of neural transducers for better TTS☆62Updated 3 weeks ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆156Updated last year
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆120Updated 2 years ago
- ☆112Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆146Updated 10 months ago
- Monotonic Alignment Search☆90Updated 2 years ago
- Various speech datasets made available to the public☆114Updated 3 months ago
- Predicts the level of noise and reverberation on your audiofiles☆148Updated 10 months ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆127Updated last year
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆142Updated last year
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆84Updated 2 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆27Updated last year
- Collection of pretrained models for the Montreal Forced Aligner☆136Updated 8 months ago
- Repository for Accent Recognition (Hackathon @SLT2022)☆25Updated 10 months ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆116Updated last year
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆88Updated 3 years ago
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆146Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆110Updated 2 years ago
- Multilingual G2P in 100 languages☆311Updated last year
- ☆140Updated last year