pseeth / soundnet_keras
SoundNet, built in Keras with pre-trained 8-layer model.
☆29Updated 5 years ago
Alternatives and similar repositories for soundnet_keras:
Users that are interested in soundnet_keras are comparing it to the libraries listed below
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 5 years ago
- ☆58Updated 6 years ago
- SiSEC MUS 2018 Submission System☆43Updated 5 years ago
- Vocode spectrograms to audio with generative adversarial networks☆63Updated 5 years ago
- 4th position solution to the MediaEval - The 2019 Emotion and Themes in Music using Jamendo☆14Updated 5 years ago
- Pytorch Implementation of FFTNet☆86Updated 6 years ago
- Learn and L3 embedding from audio/video pairs☆87Updated 2 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- A fast cnn-based vocoder☆78Updated 4 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Updated 6 years ago
- ☆56Updated 6 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 5 years ago
- Single Pass Spectrogram Inversion in a Jupyter Python notebook☆33Updated 7 years ago
- Keras Implementation and Experiments with Deep Recurrent Neural Networks for Source Separation☆19Updated 6 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 5 years ago
- A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆92Updated 6 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- DeepMind's Tacotron-2 Tensorflow implementation☆34Updated 6 years ago
- Network specification and demo☆35Updated 7 years ago
- ☆19Updated 6 years ago
- A pytorch implementation of FFTNet.☆36Updated 6 years ago
- Zero-shot Learning for Audio-based Music Classification and Tagging (ISMIR 2019)☆40Updated 5 years ago
- Multiple Instance Learning for Sound Event Detection☆34Updated 6 years ago
- ☆41Updated 4 months ago
- FFTNet vocoder implementation☆81Updated 6 years ago
- Util code, issues, discussions☆28Updated 6 years ago
- Non-Parallel Voice Conversion with Cyclic Variational Autoencoder☆52Updated 4 years ago
- Weakly Supervised CRNN System for Sound Event Detection With Large-scale Unlabeled In-domain Data☆10Updated 6 years ago
- DCASE2019 Challenge Task 1 baseline system☆20Updated 5 years ago