channelCS / Audio-VisionLinks
Implementation and reviews of Audio & Computer vision related papers in python using keras and tensorflow.
☆40Updated 6 years ago
Alternatives and similar repositories for Audio-Vision
Users that are interested in Audio-Vision are comparing it to the libraries listed below
Sorting:
- SiSEC MUS 2018 Submission System☆43Updated 5 years ago
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 6 years ago
- Singing-Voice Separation From Monaural Recordings Using Deep Recurrent Neural Networks☆62Updated 7 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 5 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Updated 7 years ago
- Audio classification with VGGish as feature extractor in TensorFlow☆130Updated 3 years ago
- Code accompanying the paper "Semi-supervised adversarial audio source separation applied to singing voice extraction"☆84Updated 6 years ago
- Learn and L3 embedding from audio/video pairs☆87Updated 3 years ago
- Tensorflow - Very Deep Convolutional Neural Networks For Raw Waveforms - https://arxiv.org/pdf/1610.00087.pdf☆74Updated 4 years ago
- Vocode spectrograms to audio with generative adversarial networks☆63Updated 5 years ago
- A TensorFlow implementation of Griffin-Lim algorithm☆79Updated 7 years ago
- How to run GPU accelerated Signal Processing in TensorFlow☆23Updated 6 years ago
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆39Updated 7 years ago
- Convolutional Neural Network for auto-tagging of audio clips on MagnaTagATune dataset☆58Updated 2 years ago
- A deep learning framework for Speech-Music discrimination of continuous audio streams☆68Updated 6 years ago
- Tensorflow Implementation of Convolutional Recurrent Neural Networks for Music Genre Classification☆55Updated 8 years ago
- Masked ConditionaL Neural Networks☆15Updated 2 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 6 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Updated 7 years ago
- Code for the paper: Audio to Score Matching by Combining Phonetic and Duration Information☆27Updated 8 years ago
- Source Separation Project For ML Jeju Camp 2017☆48Updated 7 years ago
- ☆27Updated 7 years ago
- ☆59Updated 7 years ago
- Speech Enhancement using Bayesian WaveNet☆96Updated 7 years ago
- Audio Classifier in Keras using Convolutional Neural Network☆160Updated 6 years ago
- Code for https://arxiv.org/abs/1712.00254☆16Updated 7 years ago
- DCASE2016 TASK1 Scene Classification☆12Updated 8 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆109Updated last year
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning☆101Updated 2 years ago