cvondrick / soundnet
SoundNet: Learning Sound Representations from Unlabeled Video. NIPS 2016
☆460Updated 7 years ago
Alternatives and similar repositories for soundnet:
Users that are interested in soundnet are comparing it to the libraries listed below
- TensorFlow implementation of "SoundNet".☆145Updated 6 years ago
- RNN-based generative models for speech.☆611Updated 7 years ago
- TristouNet: Triplet Loss for Speaker Turn Embedding☆123Updated 7 years ago
- Code for "Vid2speech: Speech Reconstruction from Silent Video" ICASSP '17☆116Updated 7 years ago
- The Audio Set Ontology aims to provide a comprehensive set of categories to describe sound events.☆658Updated 6 years ago
- Music auto-tagging models and trained weights in keras/theano☆611Updated 6 years ago
- A github repo of the openSMILE feature extraction tool.☆213Updated 3 years ago
- Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features☆220Updated 5 years ago
- A library for augmenting annotated audio data☆232Updated 3 years ago
- Deep Recurrent Neural Networks for Source Separation☆367Updated 3 years ago
- Singing Voice Separation via Recurrent Inference and Skip-Filtering Connections - PyTorch Implementation. Demo:☆170Updated 6 years ago
- SampleRNN: An Unconditional End-to-End Neural Audio Generation Model☆538Updated 3 years ago
- Speech Enhancement Generative Adversarial Network in TensorFlow☆829Updated last year
- Transfer learning for music classification and regression tasks☆257Updated 5 years ago
- Deep neural networks for getting text-independent speaker embedding written in TensorFlow☆309Updated 6 years ago
- Deep Learning-based Voice Conversion system☆120Updated 2 years ago
- Torch implementation for audio neural style.☆141Updated 7 years ago
- Audio Classifier in Keras using Convolutional Neural Network☆160Updated 5 years ago
- A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis☆206Updated 6 years ago
- End-to-End Attention-Based Large Vocabulary Speech Recognition☆261Updated 2 years ago
- ☆223Updated 4 years ago
- Keras implementation of deepmind's wavenet paper☆413Updated 5 years ago
- Spectrograms, MFCCs, and Inversion Demo in a jupyter notebook☆165Updated 5 years ago
- A Cooperative Voice Analysis Repository for Speech Technologies☆355Updated 4 years ago
- This is a TensorFlow implementation of the WaveNet generative neural network architecture https://deepmind.com/blog/wavenet-generative-mo…☆152Updated 6 years ago
- A UNIVERSAL MUSIC TRANSLATION NETWORK - a method for translating music across musical instruments and styles.☆461Updated 3 years ago
- Problem Agnostic Speech Encoder☆440Updated last year
- Deep Convolutional Neural Networks for Musical Source Separation☆474Updated 4 years ago
- This is a TensorFlow implementation of the WaveNet generative neural network architecture https://deepmind.com/blog/wavenet-generative-mo…☆343Updated 8 years ago
- Code for end-to-end ASR with neural networks, build with TensorFlow☆109Updated 6 years ago