cvondrick / soundnet
SoundNet: Learning Sound Representations from Unlabeled Video. NIPS 2016
☆460Updated 7 years ago
Alternatives and similar repositories for soundnet:
Users that are interested in soundnet are comparing it to the libraries listed below
- TensorFlow implementation of "SoundNet".☆145Updated 6 years ago
- A github repo of the openSMILE feature extraction tool.☆217Updated 3 years ago
- Code for "Vid2speech: Speech Reconstruction from Silent Video" ICASSP '17☆116Updated 8 years ago
- RNN-based generative models for speech.☆611Updated 7 years ago
- Deep Recurrent Neural Networks for Source Separation☆368Updated 3 years ago
- Torch implementation for audio neural style.☆139Updated 8 years ago
- SampleRNN: An Unconditional End-to-End Neural Audio Generation Model☆541Updated 3 years ago
- A library for augmenting annotated audio data☆233Updated 3 years ago
- Singing Voice Separation via Recurrent Inference and Skip-Filtering Connections - PyTorch Implementation. Demo:☆170Updated 6 years ago
- Environmental Sound Classification with Convolutional Neural Networks - paper replication data☆75Updated 7 years ago
- Speech Enhancement Generative Adversarial Network in TensorFlow☆832Updated last year
- Deep Convolutional Neural Networks for Musical Source Separation☆473Updated 5 years ago
- Spoken language identification with deep learning☆233Updated 7 years ago
- TristouNet: Triplet Loss for Speaker Turn Embedding☆123Updated 7 years ago
- The Audio Set Ontology aims to provide a comprehensive set of categories to describe sound events.☆662Updated 6 years ago
- Fetch and use Google's AudioSet dataset☆125Updated 7 years ago
- Deep Learning & 3D Convolutional Neural Networks for Speaker Verification☆782Updated 5 years ago
- Keras implementation of deepmind's wavenet paper☆413Updated 5 years ago
- TensorFlow implementation for audio neural style.☆448Updated 2 years ago
- Speech Recognition using DeepSpeech2 network and the CTC activation function.☆259Updated 7 years ago
- Deep neural networks for getting text-independent speaker embedding written in TensorFlow☆309Updated 6 years ago
- Tensorflow implementation of the models used in "End-to-end learning for music audio tagging at scale"☆150Updated 5 years ago
- Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features☆221Updated 5 years ago
- Speech Recognition Using Tacotron☆163Updated 7 years ago
- End-to-End Attention-Based Large Vocabulary Speech Recognition☆262Updated 2 years ago
- Music auto-tagging models and trained weights in keras/theano☆609Updated 6 years ago
- Transfer learning for music classification and regression tasks☆256Updated 5 years ago
- A Pytorch Implementation of ClariNet☆292Updated 5 years ago
- A convolutional neural network that classifies sounds☆159Updated 8 years ago
- PyTorch implementation of SampleRNN: An Unconditional End-to-End Neural Audio Generation Model☆292Updated last year