drammock / spectrogram-tutorial
A walkthrough of how to make spectrograms in python that are customized for human speech research.
☆39Updated last year
Alternatives and similar repositories for spectrogram-tutorial:
Users that are interested in spectrogram-tutorial are comparing it to the libraries listed below
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- ESC: Dataset for Environmental Sound Classification - paper replication data☆78Updated 7 years ago
- Implementation of the Griffin and Lim algorithm to recover an audio signal from a magnitude-only spectrogram.☆174Updated 6 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆43Updated 2 years ago
- Tensorflow - Very Deep Convolutional Neural Networks For Raw Waveforms - https://arxiv.org/pdf/1610.00087.pdf☆74Updated 4 years ago
- An end-to-end MATLAB toolkit for completely unsupervised Speaker Diarization using state-of-the-art algorithms.☆16Updated 9 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- Interspeech 2019 tutorial materials☆48Updated 5 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 5 years ago
- Estimate the number of concurrent speakers from single channel mixtures to crack the "cocktail-party” problem.☆22Updated 5 years ago
- Deep understanding and modelling of the hierarchical structure of prosody☆22Updated 5 years ago
- collaborative audio module for fast.ai☆99Updated 5 years ago
- Deep Learning experiments for audio classification☆149Updated 7 years ago
- Spectrograms, MFCCs, and Inversion Demo in a jupyter notebook☆166Updated 5 years ago
- Utils and data sets for audio and PyTorch☆85Updated 3 years ago
- Environmental Sound Classification with Convolutional Neural Networks - paper replication data☆75Updated 7 years ago
- Unsupervised segmentation and clustering of Buckeye English and NCHLT Xitsonga corpora.☆9Updated 8 years ago
- Speech Recognition model based off of FAIR research paper built using Pytorch.☆83Updated 6 years ago
- Learning embeddings for laughter categorization☆34Updated 6 years ago
- Vocode spectrograms to audio with generative adversarial networks☆63Updated 5 years ago
- Gammatone-based spectrograms, using gammatone filterbanks or Fourier transform weightings.☆221Updated last year
- It uses GMM to train a gender detector model. The testing has been done on subset of Google's AudioSet corpus.☆19Updated 7 years ago
- How to run GPU accelerated Signal Processing in TensorFlow☆23Updated 6 years ago
- Multiple Instance Learning for Sound Event Detection☆34Updated 7 years ago
- ABX and kaldi experiments on speech corpora made easy☆32Updated 6 months ago
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning☆101Updated 2 years ago
- Bayesian spEEch Recognizer☆55Updated 4 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 7 years ago
- Automatic Measurement of Vowel Duration for Consonant Vowel Consonant (CVC) sound files (JASA 2016)☆14Updated 8 years ago
- ☆132Updated 7 months ago