hyyoka / Acoustic-FeaturesLinks
audio/speech feature extraction using parselmouth, librosa, disvoice
☆10Updated 3 years ago
Alternatives and similar repositories for Acoustic-Features
Users that are interested in Acoustic-Features are comparing it to the libraries listed below
Sorting:
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆34Updated last year
- These are Jupyter Notebooks to help guide people to learn how to use Praat-Parselmouth☆42Updated 4 years ago
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binder☆134Updated 4 years ago
- ICSD Dataset☆40Updated 7 months ago
- ☆70Updated last year
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆52Updated 3 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Updated last year
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆24Updated 2 years ago
- ☆40Updated 3 years ago
- Keyword spotting and forced alignment in any language☆82Updated 4 months ago
- Making Espnet easier to use☆54Updated 4 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Updated last year
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆133Updated last month
- Official Implementation of Mockingjay in Pytorch☆55Updated 2 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆49Updated 4 years ago
- Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.☆55Updated 2 years ago
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆43Updated 2 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33Updated 3 years ago
- ☆52Updated 4 years ago
- ☆22Updated last year
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…☆24Updated 2 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆63Updated 4 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆157Updated 3 years ago
- Official implement of SpeechFormer written in Python (PyTorch).☆79Updated 2 years ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆21Updated 7 months ago
- ☆54Updated 7 months ago
- Extract frequency, power, width and dissonance of formants from wav files☆28Updated 3 years ago
- Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.☆29Updated 6 years ago
- [ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Spe…☆60Updated last year