pilarOG / prosodic-analysisLinks
Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice quality
☆23Updated 6 years ago
Alternatives and similar repositories for prosodic-analysis
Users that are interested in prosodic-analysis are comparing it to the libraries listed below
Sorting:
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 4 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆61Updated 2 years ago
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆18Updated 2 years ago
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binder☆133Updated 4 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆43Updated 2 years ago
- a deep accent recognition network☆49Updated 4 years ago
- ☆40Updated 3 years ago
- Mispronunciation detection code for jingju singing voice☆20Updated 7 years ago
- A Python toolbox for speech features extraction☆165Updated 2 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- Forced Alignments for Common Voice☆32Updated 5 years ago
- Multilingual Grapheme to Phoneme☆50Updated 9 years ago
- End to End Dialect Identification using Convolutional Neural Network☆53Updated 6 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 4 years ago
- Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/vp2020/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf☆64Updated 2 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Updated 4 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆90Updated 5 years ago
- Urdu Language Speech Emotional Corpus☆46Updated 6 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated 2 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆45Updated 2 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆82Updated 4 years ago
- Grapheme To Phoneme☆73Updated last year
- Speaker diarization python system based on binary key speaker modelling☆60Updated 3 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆79Updated 3 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 7 years ago
- This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data…☆137Updated 4 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- Official Implementation of Mockingjay in Pytorch☆55Updated 2 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆111Updated last year
- A lightweight library to compute Diarization Error Rate (DER).☆62Updated 2 years ago