uzaymacar / simple-speech-featuresLinks
Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.
☆23Updated 5 years ago
Alternatives and similar repositories for simple-speech-features
Users that are interested in simple-speech-features are comparing it to the libraries listed below
Sorting:
- Matlab tools for pathological voice analysis☆13Updated 2 years ago
- A unified dataset of multilingual emotional human utterances☆25Updated 3 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆132Updated 5 months ago
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binder☆131Updated 3 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆43Updated 2 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆25Updated 10 months ago
- Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.☆46Updated 2 years ago
- Deep Articulatory Synthesis and Inversion☆49Updated last year
- RASTA-PLP and MFCC tool based rasta-mat☆33Updated 2 years ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆83Updated last year
- End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation☆22Updated 3 years ago
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆57Updated 4 years ago
- ☆30Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 4 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- MSP-Podcast Challenge Baseline Code☆22Updated 11 months ago
- ☆60Updated 4 years ago
- Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.☆15Updated 3 years ago
- Repo associated to the DESED dataset, download and creation of data☆139Updated 10 months ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Updated 5 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆39Updated 4 years ago
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆39Updated 11 months ago
- Support for Clarity Enhancement and Prediction Challenges (obsolete - see README)☆48Updated 3 years ago
- Baseline kaldi script for UA-SPEECH corpus☆30Updated 7 months ago
- Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch☆105Updated 4 years ago
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆24Updated 6 months ago
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆45Updated 4 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆41Updated 3 years ago