channelCS / Audio-Vision
Implementation and reviews of Audio & Computer vision related papers in python using keras and tensorflow.
☆40Updated 6 years ago
Alternatives and similar repositories for Audio-Vision:
Users that are interested in Audio-Vision are comparing it to the libraries listed below
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 6 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 5 years ago
- Vocode spectrograms to audio with generative adversarial networks☆63Updated 5 years ago
- A TensorFlow implementation of Griffin-Lim algorithm☆78Updated 6 years ago
- ISMIR2016: Melody extraction on vocal segments using multi-column deep neural networks☆19Updated 7 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Updated 6 years ago
- SiSEC MUS 2018 Submission System☆43Updated 5 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Updated 7 years ago
- Code accompanying the paper "Semi-supervised adversarial audio source separation applied to singing voice extraction"☆83Updated 6 years ago
- Repository containg experiments with Extreme Learning Machines And Reservoir Computing, ELMARC.☆20Updated 6 years ago
- Music genre classification model using CRNN☆68Updated 6 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago
- A deep learning framework for Speech-Music discrimination of continuous audio streams☆68Updated 6 years ago
- ☆46Updated 6 years ago
- Kaggle Freesound Audio Tagging 2019 Competition Solution☆28Updated 5 years ago
- Singing-Voice Separation From Monaural Recordings Using Deep Recurrent Neural Networks☆60Updated 6 years ago
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆38Updated 7 years ago
- Training General-Purpose Audio Tagging Networks with Noisy Labels and Iterative Self-Verification☆29Updated 5 years ago
- Code for the paper: Audio to Score Matching by Combining Phonetic and Duration Information☆27Updated 7 years ago
- ☆27Updated 6 years ago
- ☆58Updated 6 years ago
- How to run GPU accelerated Signal Processing in TensorFlow☆23Updated 6 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- Code for "Comparison and Analysis of SampleCNN Architectures for Audio Classification", IEEE Journal of Selected Topics in Signal Process…☆21Updated 5 years ago
- Learn and L3 embedding from audio/video pairs☆87Updated 2 years ago
- Source Separation for Audio Applications using Online NMF☆13Updated 8 years ago
- A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learn☆143Updated 2 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 6 years ago
- Single Pass Spectrogram Inversion in a Jupyter Python notebook☆33Updated 7 years ago