jim-schwoebel / voiceome
🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances, 80+ health labels). Preprint: https://www.medrxiv.org/content/10.1101/2021.08.16.21262125v1
☆28Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for voiceome
- Keras-based python framework to compute phonological posterior probabilities from audio files☆37Updated last year
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆21Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated last year
- ☆32Updated 2 years ago
- A library of speech gadgets.☆13Updated 2 years ago
- Audio activity detector based on per-channel energy normalization (PCEN)☆30Updated 5 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 3 years ago
- Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".☆13Updated last year
- Download and create a tfreader for the audioset dataset☆16Updated 4 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆24Updated 4 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- ☆22Updated last year
- ☆40Updated 2 years ago
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆34Updated 6 years ago
- Viterbi decoding in PyTorch☆26Updated last month
- ☆22Updated 3 years ago
- Feature extractor for DL speech processing.☆65Updated 2 years ago
- ☆12Updated 3 years ago
- Prosodic Speech Segmentation with Transformers☆23Updated 8 months ago
- Official Implementation of Mockingjay in Pytorch☆52Updated last year
- Interspeech 2019 tutorial materials☆48Updated 5 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆12Updated 3 years ago
- ☆19Updated 6 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆38Updated 4 years ago
- End-to-end diarization loss☆22Updated 3 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- ☆10Updated last year
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 3 years ago
- Text to Speech Synthesis based on controllable latent representation☆14Updated 5 years ago
- ☆24Updated 6 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆14Updated 2 years ago