cvondrick / soundnet
SoundNet: Learning Sound Representations from Unlabeled Video. NIPS 2016
☆458Updated 6 years ago
Related projects: ⓘ
- TensorFlow implementation of "SoundNet".☆145Updated 6 years ago
- The Audio Set Ontology aims to provide a comprehensive set of categories to describe sound events.☆644Updated 6 years ago
- RNN-based generative models for speech.☆611Updated 7 years ago
- Speech Enhancement Generative Adversarial Network in TensorFlow☆811Updated last year
- End-to-End Attention-Based Large Vocabulary Speech Recognition☆260Updated last year
- Deep Recurrent Neural Networks for Source Separation☆365Updated 3 years ago
- Singing Voice Separation via Recurrent Inference and Skip-Filtering Connections - PyTorch Implementation. Demo:☆172Updated 5 years ago
- Code for "Vid2speech: Speech Reconstruction from Silent Video" ICASSP '17☆116Updated 7 years ago
- SampleRNN: An Unconditional End-to-End Neural Audio Generation Model☆533Updated 2 years ago
- A library for augmenting annotated audio data☆231Updated 3 years ago
- Deep neural networks for getting text-independent speaker embedding written in TensorFlow☆307Updated 5 years ago
- A github repo of the openSMILE feature extraction tool.☆214Updated 2 years ago
- Spoken language identification with deep learning☆233Updated 6 years ago
- Neural net code for lexicon-free speech recognition with connectionist temporal classification☆248Updated 8 years ago
- The official repository of the Eesen project☆202Updated 8 years ago
- A Pytorch Implementation of ClariNet☆288Updated 5 years ago
- ☆221Updated 4 years ago
- Speech Recognition Using Tacotron☆164Updated 7 years ago
- CTC + Tensorflow Example for ASR☆313Updated 6 years ago
- Keras implementation of deepmind's wavenet paper☆414Updated 5 years ago
- Deep Learning & 3D Convolutional Neural Networks for Speaker Verification☆779Updated 4 years ago
- Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features☆220Updated 5 years ago
- Transfer learning for music classification and regression tasks☆254Updated 4 years ago
- Deep Convolutional Neural Networks for Musical Source Separation☆469Updated 4 years ago
- Spectrograms, MFCCs, and Inversion Demo in a jupyter notebook☆164Updated 5 years ago
- PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)☆515Updated 3 years ago
- PyTorch implementation of SampleRNN: An Unconditional End-to-End Neural Audio Generation Model☆285Updated last year
- Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).☆130Updated 3 years ago
- Music auto-tagging models and trained weights in keras/theano☆614Updated 6 years ago
- TristouNet: Triplet Loss for Speaker Turn Embedding☆124Updated 7 years ago