pseeth / soundnet_keras
SoundNet, built in Keras with pre-trained 8-layer model.
☆29Updated 5 years ago
Alternatives and similar repositories for soundnet_keras:
Users that are interested in soundnet_keras are comparing it to the libraries listed below
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 6 years ago
- Vocode spectrograms to audio with generative adversarial networks☆63Updated 5 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- wavenet vocoder using tensorflow☆27Updated 7 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- SiSEC MUS 2018 Submission System☆43Updated 5 years ago
- Implementation and reviews of Audio & Computer vision related papers in python using keras and tensorflow.☆40Updated 6 years ago
- Learn and L3 embedding from audio/video pairs☆87Updated 2 years ago
- ☆58Updated 6 years ago
- ☆19Updated 7 years ago
- ☆27Updated 6 years ago
- A fast cnn-based vocoder☆78Updated 4 years ago
- ☆31Updated 6 years ago
- A Pytorch Implementation of MelGAN☆67Updated 5 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Updated 7 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Updated 5 years ago
- ☆33Updated 5 years ago
- Single Pass Spectrogram Inversion in a Jupyter Python notebook☆34Updated 7 years ago
- 2018/2019 TTS framework integrating state of the art open source methods☆47Updated 5 years ago
- Network specification and demo☆35Updated 7 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 6 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆107Updated last year
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- A pytorch implementation of FFTNet.☆36Updated 6 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆86Updated 4 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago
- A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆92Updated 6 years ago
- VoxSRC Challenge☆31Updated 5 years ago
- A TensorFlow implementation of Griffin-Lim algorithm☆78Updated 6 years ago