pseeth / soundnet_kerasLinks
SoundNet, built in Keras with pre-trained 8-layer model.
☆29Updated 5 years ago
Alternatives and similar repositories for soundnet_keras
Users that are interested in soundnet_keras are comparing it to the libraries listed below
Sorting:
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 6 years ago
- Learn and L3 embedding from audio/video pairs☆87Updated 3 years ago
- ☆59Updated 7 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated 2 years ago
- Vocode spectrograms to audio with generative adversarial networks☆63Updated 6 years ago
- Implementation and reviews of Audio & Computer vision related papers in python using keras and tensorflow.☆40Updated 6 years ago
- ESC: Dataset for Environmental Sound Classification - paper replication data☆79Updated 7 years ago
- Tensorflow - Very Deep Convolutional Neural Networks For Raw Waveforms - https://arxiv.org/pdf/1610.00087.pdf☆74Updated 4 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Updated 4 years ago
- A fast cnn-based vocoder☆78Updated 5 years ago
- Pytorch Implementation of FFTNet☆86Updated 7 years ago
- SiSEC MUS 2018 Submission System☆43Updated 6 years ago
- A PyTorch implementation of Meta-TasNet from "Meta-learning Extractors for Music Source Separation☆138Updated last year
- ☆27Updated 7 years ago
- Code for https://arxiv.org/abs/1712.00254☆16Updated 7 years ago
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Updated 5 years ago
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning☆102Updated 2 weeks ago
- A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆94Updated 7 years ago
- Pytorch and TensorFlow data loaders for several audio datasets☆113Updated 5 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 6 years ago
- Multiple Instance Learning for Sound Event Detection☆34Updated 7 years ago
- Code accompanying ISMIR'19 paper titled "Learning to Traverse Latent Spaces for Musical Score Inpaintning"☆47Updated 4 years ago
- Utils and data sets for audio and PyTorch☆86Updated 3 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Updated 7 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated 2 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆110Updated last year
- Zero-shot Learning for Audio-based Music Classification and Tagging (ISMIR 2019)☆42Updated 5 years ago
- Baseline systems for the FSD50K dataset☆70Updated 4 years ago
- Network specification and demo☆35Updated 8 years ago