channelCS / Audio-Vision
Implementation and reviews of Audio & Computer vision related papers in python using keras and tensorflow.
☆40Updated 6 years ago
Alternatives and similar repositories for Audio-Vision:
Users that are interested in Audio-Vision are comparing it to the libraries listed below
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 6 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Updated 7 years ago
- A TensorFlow implementation of Griffin-Lim algorithm☆78Updated 6 years ago
- Vocode spectrograms to audio with generative adversarial networks☆63Updated 5 years ago
- SiSEC MUS 2018 Submission System☆43Updated 5 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- Code for the paper: Audio to Score Matching by Combining Phonetic and Duration Information☆27Updated 7 years ago
- Single Pass Spectrogram Inversion in a Jupyter Python notebook☆34Updated 7 years ago
- Code for https://arxiv.org/abs/1712.00254☆16Updated 7 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 5 years ago
- ISMIR2016: Melody extraction on vocal segments using multi-column deep neural networks☆19Updated 7 years ago
- ☆27Updated 6 years ago
- Learn and L3 embedding from audio/video pairs☆87Updated 2 years ago
- Singing-Voice Separation From Monaural Recordings Using Deep Recurrent Neural Networks☆60Updated 6 years ago
- Weakly Supervised CRNN System for Sound Event Detection With Large-scale Unlabeled In-domain Data☆10Updated 6 years ago
- Code accompanying the paper "Semi-supervised adversarial audio source separation applied to singing voice extraction"☆83Updated 6 years ago
- DCASE2016 TASK1 Scene Classification☆12Updated 7 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- Training General-Purpose Audio Tagging Networks with Noisy Labels and Iterative Self-Verification☆29Updated 5 years ago
- Multiple Instance Learning for Sound Event Detection☆34Updated 6 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 6 years ago
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆38Updated 7 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Updated 5 years ago
- Task 4 Large-scale weakly supervised sound event detection for smart cars☆65Updated 3 years ago
- Parse and process the demixing secrets dataset (DSD100)☆49Updated 6 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Updated 6 years ago
- A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learn☆143Updated 2 years ago
- Deep Convolutional Networks on the Pitch Spiral for Musical Instrument Recognition☆41Updated 8 years ago
- Revisiting Singing Voice Detection : a Quantitative Review and the Future Outlook☆67Updated 2 years ago
- Bag-of-Features Acoustic Event Detection☆14Updated 8 years ago