pilarOG / prosodic-analysisLinks
Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice quality
☆23Updated 6 years ago
Alternatives and similar repositories for prosodic-analysis
Users that are interested in prosodic-analysis are comparing it to the libraries listed below
Sorting:
- Urdu Language Speech Emotional Corpus☆46Updated 6 years ago
- This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data…☆136Updated 4 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆61Updated 2 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Updated 5 years ago
- ☆40Updated 3 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆43Updated 2 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 4 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 5 years ago
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binder☆133Updated 4 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Updated 4 years ago
- A Python toolbox for speech features extraction☆165Updated 2 years ago
- The python implementation for paper "Towards Discriminative Representation Learning for Speech Emotion Recognition" in IJCAI-2019☆23Updated 6 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆82Updated 4 years ago
- End to End Dialect Identification using Convolutional Neural Network☆53Updated 6 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 4 years ago
- ☆30Updated 3 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated 2 years ago
- Forced Alignments for Common Voice☆31Updated 5 years ago
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆143Updated 3 years ago
- MSP-Podcast Challenge Baseline Code☆28Updated last year
- Official Implementation of Mockingjay in Pytorch☆55Updated 2 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆34Updated 6 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 6 years ago
- PyTorch implementation of RPNSD☆60Updated last year
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Updated 3 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆49Updated 3 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆62Updated 4 years ago
- Mispronunciation detection code for jingju singing voice☆20Updated 7 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆33Updated last year