cvondrick / soundnet
SoundNet: Learning Sound Representations from Unlabeled Video. NIPS 2016
☆461Updated 7 years ago
Alternatives and similar repositories for soundnet
Users that are interested in soundnet are comparing it to the libraries listed below
Sorting:
- TensorFlow implementation of "SoundNet".☆145Updated 7 years ago
- End-to-End Attention-Based Large Vocabulary Speech Recognition☆262Updated 2 years ago
- Code for "Vid2speech: Speech Reconstruction from Silent Video" ICASSP '17☆116Updated 8 years ago
- Singing Voice Separation via Recurrent Inference and Skip-Filtering Connections - PyTorch Implementation. Demo:☆171Updated 6 years ago
- Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features☆220Updated 5 years ago
- The Audio Set Ontology aims to provide a comprehensive set of categories to describe sound events.☆669Updated 6 years ago
- Speech Recognition using DeepSpeech2 network and the CTC activation function.☆259Updated 7 years ago
- The official repository of the Eesen project☆201Updated 8 years ago
- A library for augmenting annotated audio data☆234Updated 4 years ago
- Deep neural networks for getting text-independent speaker embedding written in TensorFlow☆310Updated 6 years ago
- Deep Learning & 3D Convolutional Neural Networks for Speaker Verification☆785Updated 5 years ago
- Environmental Sound Classification with Convolutional Neural Networks - paper replication data☆75Updated 7 years ago
- Deep Recurrent Neural Networks for Source Separation☆368Updated 3 years ago
- RNN-based generative models for speech.☆611Updated 7 years ago
- Spoken language identification with deep learning☆232Updated 7 years ago
- Speech Recognition Using Tacotron☆163Updated 7 years ago
- Torch implementation for audio neural style.☆140Updated 8 years ago
- A Pytorch Implementation of ClariNet☆292Updated 5 years ago
- Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).☆130Updated 4 years ago
- DeepSpeech neon implementation☆223Updated 2 years ago
- Fetch and use Google's AudioSet dataset☆126Updated 8 years ago
- Deep Voice Real-time Neural TTS System☆160Updated 8 years ago
- Implementation of Google's Tacotron in TensorFlow☆236Updated 7 years ago
- TristouNet: Triplet Loss for Speaker Turn Embedding☆123Updated 7 years ago
- A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis☆205Updated 6 years ago
- A github repo of the openSMILE feature extraction tool.☆217Updated 3 years ago
- Deep Learning experiments for audio classification☆150Updated 7 years ago
- Audio Classifier in Keras using Convolutional Neural Network☆160Updated 6 years ago
- Tensorflow implementation of the models used in "End-to-end learning for music audio tagging at scale"☆150Updated 5 years ago
- Speech recognition software where the neural net is trained with TensorFlow and GMM training and decoding is done in Kaldi☆170Updated 8 years ago