cvondrick / soundnetLinks
SoundNet: Learning Sound Representations from Unlabeled Video. NIPS 2016
☆462Updated 7 years ago
Alternatives and similar repositories for soundnet
Users that are interested in soundnet are comparing it to the libraries listed below
Sorting:
- TensorFlow implementation of "SoundNet".☆145Updated 7 years ago
- Singing Voice Separation via Recurrent Inference and Skip-Filtering Connections - PyTorch Implementation. Demo:☆171Updated 6 years ago
- RNN-based generative models for speech.☆610Updated 7 years ago
- The Audio Set Ontology aims to provide a comprehensive set of categories to describe sound events.☆673Updated 7 years ago
- Speech Enhancement Generative Adversarial Network in TensorFlow☆842Updated 2 years ago
- Deep Convolutional Neural Networks for Musical Source Separation☆477Updated 5 years ago
- Deep Recurrent Neural Networks for Source Separation☆369Updated 3 years ago
- Keras implementation of deepmind's wavenet paper☆413Updated 5 years ago
- Torch implementation for audio neural style.☆140Updated 8 years ago
- Code for "Vid2speech: Speech Reconstruction from Silent Video" ICASSP '17☆116Updated 8 years ago
- SampleRNN: An Unconditional End-to-End Neural Audio Generation Model☆542Updated 3 years ago
- End-to-End Attention-Based Large Vocabulary Speech Recognition☆262Updated 2 years ago
- Music auto-tagging models and trained weights in keras/theano☆613Updated 6 years ago
- A library for augmenting annotated audio data☆234Updated 4 years ago
- Fetch and use Google's AudioSet dataset☆126Updated 8 years ago
- Deep Learning experiments for audio classification☆149Updated 7 years ago
- TristouNet: Triplet Loss for Speaker Turn Embedding☆123Updated 7 years ago
- Speech Recognition using DeepSpeech2 network and the CTC activation function.☆259Updated 8 years ago
- TensorFlow implementation for audio neural style.☆451Updated 3 years ago
- Deep neural networks for getting text-independent speaker embedding written in TensorFlow☆310Updated 6 years ago
- CTC + Tensorflow Example for ASR☆312Updated 7 years ago
- VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasets☆388Updated 6 years ago
- Speech Recognition Using Tacotron☆163Updated 7 years ago
- Deep Learning & 3D Convolutional Neural Networks for Speaker Verification☆786Updated 5 years ago
- Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features☆220Updated 5 years ago
- Spoken language identification with deep learning☆232Updated 7 years ago
- A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis☆205Updated 6 years ago
- A Pytorch Implementation of ClariNet☆292Updated 5 years ago
- PyTorch implementation of SampleRNN: An Unconditional End-to-End Neural Audio Generation Model☆292Updated 2 years ago
- End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)☆314Updated 7 years ago