pseeth / soundnet_keras
SoundNet, built in Keras with pre-trained 8-layer model.
☆29Updated 5 years ago
Alternatives and similar repositories for soundnet_keras
Users that are interested in soundnet_keras are comparing it to the libraries listed below
Sorting:
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 6 years ago
- Vocode spectrograms to audio with generative adversarial networks☆63Updated 5 years ago
- ☆59Updated 7 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- A fast cnn-based vocoder☆78Updated 4 years ago
- ☆56Updated 6 years ago
- Multiple Instance Learning for Sound Event Detection☆34Updated 7 years ago
- wavenet vocoder using tensorflow☆26Updated 7 years ago
- Learn and L3 embedding from audio/video pairs☆87Updated 3 years ago
- Network specification and demo☆35Updated 7 years ago
- ☆27Updated 7 years ago
- Zero-shot Learning for Audio-based Music Classification and Tagging (ISMIR 2019)☆41Updated 5 years ago
- SiSEC MUS 2018 Submission System☆43Updated 5 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Updated 7 years ago
- pytorch implementation of lyre.ai's char2wav model☆32Updated 7 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 6 years ago
- pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf☆43Updated 7 years ago
- Task 4 Large-scale weakly supervised sound event detection for smart cars☆65Updated 3 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆92Updated 6 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Updated 3 years ago
- ☆19Updated 7 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 5 years ago
- ☆21Updated 7 years ago
- 2018/2019 TTS framework integrating state of the art open source methods☆47Updated 5 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- Single Pass Spectrogram Inversion in a Jupyter Python notebook☆34Updated 7 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 6 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆85Updated 4 years ago
- Interspeech 2019 tutorial materials☆48Updated 5 years ago