jim-schwoebel / voiceomeLinks
π₯ π€ The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances, 80+ health labels). Preprint: https://www.medrxiv.org/content/10.1101/2021.08.16.21262125v1
β29Updated 2 months ago
Alternatives and similar repositories for voiceome
Users that are interested in voiceome are comparing it to the libraries listed below
Sorting:
- Implementation of the DIVA model of speech acquisition and production using PyTorchβ21Updated 2 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio filesβ43Updated 2 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognitionβ28Updated 2 years ago
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/projectβ¦β12Updated 3 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.β15Updated 5 years ago
- Pytorch based phoneme recognition (TIMIT phoneme classification)β34Updated 7 years ago
- β12Updated 4 years ago
- A handy dataset of noises for ASRβ21Updated 6 years ago
- Feature extractor for DL speech processing.β65Updated 3 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Yβ¦β25Updated 6 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challengeβ15Updated 3 years ago
- β22Updated last year
- Python library for audio augmentationβ84Updated last year
- A speech signal processing library in Python with emphasis on deep learning.β31Updated 2 years ago
- β40Updated 3 years ago
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesisβ23Updated 3 years ago
- Deep Speech Distances PyTorchβ29Updated 3 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systemsβ19Updated 4 years ago
- Rescoring methods for end-to-end Automatic Speech Recognitionβ27Updated 4 years ago
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speechβ17Updated last year
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?β10Updated last month
- A repository comprising of code for generation of noisy speech data from clean data using deep learning methodsβ14Updated 3 years ago
- A deep learning model for classifying audio frames into [SPEECH, KCHI, CHI, MAL, FEM] classes.β44Updated last week
- A PyTorch 1.0 implementation of the convolutions described in SincNetβ33Updated 6 years ago
- Text to Speech Synthesis based on controllable latent representationβ14Updated 5 years ago
- Detect emotion from audioβ13Updated 6 years ago
- Transformer-based online speech recognition system with TensorFlow 2β26Updated 4 years ago
- Interspeech 2019 tutorial materialsβ48Updated 5 years ago
- Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".β13Updated 2 years ago
- Machine learning speaker characteristicsβ35Updated 2 weeks ago