LexicalStressDetection / lexical-stress-detectionLinks
Deep Learning model for lexical stress detection in spoken English
☆29Updated 5 years ago
Alternatives and similar repositories for lexical-stress-detection
Users that are interested in lexical-stress-detection are comparing it to the libraries listed below
Sorting:
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated 2 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 6 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆56Updated 8 months ago
- ☆27Updated 4 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆34Updated 2 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated 2 years ago
- This project is about performing Speaker diarization for Hindi Language.☆58Updated 4 years ago
- Tools to create your own voice dataset for TTS training☆70Updated 5 years ago
- Unofficial Keras implementation of Google AI VoiceFilter☆43Updated 2 years ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆90Updated 9 months ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆52Updated 3 years ago
- This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Mul…☆37Updated last year
- ☆70Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆93Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆68Updated 4 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆154Updated last year
- Spot the conversation: speaker diarisation in the wild☆157Updated 3 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆62Updated 2 weeks ago
- Extract frequency, power, width and dissonance of formants from wav files☆28Updated 3 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆71Updated 4 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆220Updated 2 years ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆52Updated 4 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆109Updated 3 years ago
- The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines …☆64Updated 4 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆113Updated last month
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆196Updated 2 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆265Updated last year