channelCS / Audio-VisionLinks
Implementation and reviews of Audio & Computer vision related papers in python using keras and tensorflow.
☆40Updated 7 years ago
Alternatives and similar repositories for Audio-Vision
Users that are interested in Audio-Vision are comparing it to the libraries listed below
Sorting:
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 6 years ago
- Tensorflow - Very Deep Convolutional Neural Networks For Raw Waveforms - https://arxiv.org/pdf/1610.00087.pdf☆75Updated 4 years ago
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 6 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated 2 years ago
- Learn and L3 embedding from audio/video pairs☆88Updated 3 years ago
- ☆59Updated 7 years ago
- A TensorFlow implementation of Griffin-Lim algorithm☆79Updated 7 years ago
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆39Updated 8 years ago
- Audio classification with VGGish as feature extractor in TensorFlow☆131Updated 4 years ago
- Audio data augmentation examples☆34Updated 7 years ago
- Code accompanying the paper "Semi-supervised adversarial audio source separation applied to singing voice extraction"☆84Updated 6 years ago
- Audio Classifier in Keras using Convolutional Neural Network☆160Updated 6 years ago
- Deep Learning experiments for audio classification☆148Updated 8 years ago
- A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learn☆145Updated 3 years ago
- ☆27Updated 7 years ago
- Environmental Sound Classification with Convolutional Neural Networks - paper replication data☆75Updated 8 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Updated 7 years ago
- ESC: Dataset for Environmental Sound Classification - paper replication data☆82Updated 8 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Updated 7 years ago
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning☆102Updated 3 months ago
- A library for augmenting annotated audio data☆235Updated 4 years ago
- Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"☆99Updated 6 years ago
- Convolutional neural networks for sound classification☆20Updated 8 years ago
- DCASE 2018 Baseline systems☆130Updated 6 years ago
- Vocode spectrograms to audio with generative adversarial networks☆63Updated 6 years ago
- Pytorch Implementation of FFTNet☆87Updated 7 years ago
- DCASE 2017 Baseline system☆82Updated 5 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 7 years ago
- Tensorflow implementation of the models used in "End-to-end learning for music audio tagging at scale"☆151Updated 6 years ago
- Tensorflow implementation of "Speaker-independent Speech Separation with Deep Attractor Network"☆90Updated 4 years ago