pilarOG / prosodic-analysisLinks
Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice quality
☆22Updated 6 years ago
Alternatives and similar repositories for prosodic-analysis
Users that are interested in prosodic-analysis are comparing it to the libraries listed below
Sorting:
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Updated 4 years ago
- A Python toolbox for speech features extraction☆165Updated 2 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆61Updated 2 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 6 years ago
- End to End Dialect Identification using Convolutional Neural Network☆53Updated 5 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 4 years ago
- ☆30Updated 3 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Updated 5 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated 2 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- ☆40Updated 3 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆79Updated 3 years ago
- Official Implementation of Mockingjay in Pytorch☆56Updated 2 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆43Updated 2 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆61Updated 2 years ago
- Mispronunciation detection code for jingju singing voice☆20Updated 7 years ago
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binder☆133Updated 3 years ago
- a deep accent recognition network☆48Updated 4 years ago
- This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data…☆136Updated 3 years ago
- simple textgrid to csv converter☆26Updated 4 years ago
- Grapheme To Phoneme☆73Updated last year
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆17Updated 2 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆95Updated 2 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆34Updated 6 years ago
- Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/vp2020/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf☆64Updated 2 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆82Updated 3 years ago
- Python library for audio augmentation☆84Updated 2 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 4 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆62Updated 4 years ago