jim-schwoebel / voiceomeLinks
🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances, 80+ health labels). Preprint: https://www.medrxiv.org/content/10.1101/2021.08.16.21262125v1
☆30Updated 6 months ago
Alternatives and similar repositories for voiceome
Users that are interested in voiceome are comparing it to the libraries listed below
Sorting:
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆21Updated 2 years ago
- Feature extractor for DL speech processing.☆66Updated 3 years ago
- Python library for audio augmentation☆84Updated 2 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆43Updated 2 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 6 years ago
- Emotion detection in audio utilising self-supervised representations trained with Contrastive Predictive Coding (CPC).☆43Updated 3 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 4 years ago
- A Python toolbox for speech features extraction☆165Updated 2 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- ☆40Updated 3 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Updated 2 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated last year
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Updated 2 years ago
- ☆23Updated 2 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Grapheme to phoneme model for PyTorch☆41Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
- asr2k☆52Updated last year
- ☆56Updated 2 years ago
- ABX and kaldi experiments on speech corpora made easy☆33Updated last year
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated 2 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Updated last year
- ☆33Updated 3 years ago
- ☆32Updated 3 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- ☆21Updated 7 years ago
- Text to Speech Synthesis based on controllable latent representation☆14Updated 6 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago