pilarOG / prosodic-analysis
Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice quality
☆21Updated 5 years ago
Alternatives and similar repositories for prosodic-analysis:
Users that are interested in prosodic-analysis are comparing it to the libraries listed below
- ☆40Updated 3 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Updated 4 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆41Updated 2 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 4 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 6 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- End-to-end spoken language identification out of the box.☆48Updated 4 years ago
- End to End Dialect Identification using Convolutional Neural Network☆52Updated 5 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- Mispronunciation detection code for jingju singing voice☆20Updated 6 years ago
- Dialect identification using Siamese network☆15Updated 7 years ago
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆16Updated last year
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆35Updated 9 months ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆43Updated 3 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆14Updated last year
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆39Updated 4 years ago
- Machine learning speaker characteristics☆33Updated last week
- These are Jupyter Notebooks to help guide people to learn how to use Praat-Parselmouth☆39Updated 3 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- a deep accent recognition network☆48Updated 3 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆28Updated 3 years ago
- Experiments on speech recognition robustness to accents and dialects☆12Updated 5 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 5 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆27Updated last year
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 4 years ago