hyyoka / Acoustic-Features
audio/speech feature extraction using parselmouth, librosa, disvoice
☆9Updated 2 years ago
Alternatives and similar repositories for Acoustic-Features:
Users that are interested in Acoustic-Features are comparing it to the libraries listed below
- ☆62Updated 4 months ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆23Updated 11 months ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆50Updated 2 years ago
- Phoneme segmentation using pre-trained speech models☆54Updated 2 years ago
- These are Jupyter Notebooks to help guide people to learn how to use Praat-Parselmouth☆38Updated 3 years ago
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…☆20Updated last year
- Official Implementation of Mockingjay in Pytorch☆53Updated last year
- Script to perform statistical significance test between ASR hypotheses.☆21Updated 7 years ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆36Updated last year
- ☆21Updated 6 months ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Updated last year
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆127Updated 2 years ago
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆32Updated 6 months ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆38Updated 2 years ago
- Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level Prediction (ICASSP 2022)☆14Updated 2 years ago
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Updated 8 months ago
- A unified dataset of multilingual emotional human utterances☆24Updated 3 years ago
- ☆48Updated 3 years ago
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆33Updated last year
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆65Updated 3 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆57Updated 3 years ago
- Making Espnet easier to use☆53Updated 3 years ago
- SpeechFormer++ in PyTorch☆44Updated last year
- Streaming Audiotransformers for online Audio tagging☆43Updated 7 months ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Updated 2 years ago
- Clustering-based methods for overlapping diarization☆74Updated last year
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆37Updated 5 months ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆43Updated 3 years ago
- Workflow for forced alignment between languages☆17Updated 11 months ago
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆16Updated last year