cvondrick / soundnet
SoundNet: Learning Sound Representations from Unlabeled Video. NIPS 2016
☆460Updated 7 years ago
Alternatives and similar repositories for soundnet:
Users that are interested in soundnet are comparing it to the libraries listed below
- TensorFlow implementation of "SoundNet".☆145Updated 6 years ago
- Singing Voice Separation via Recurrent Inference and Skip-Filtering Connections - PyTorch Implementation. Demo:☆170Updated 6 years ago
- A library for augmenting annotated audio data☆233Updated 3 years ago
- The Audio Set Ontology aims to provide a comprehensive set of categories to describe sound events.☆664Updated 6 years ago
- Deep neural networks for getting text-independent speaker embedding written in TensorFlow☆309Updated 6 years ago
- RNN-based generative models for speech.☆611Updated 7 years ago
- Spoken language identification with deep learning☆233Updated 7 years ago
- Torch implementation for audio neural style.☆139Updated 8 years ago
- End-to-End Attention-Based Large Vocabulary Speech Recognition☆262Updated 2 years ago
- The code for the MaD TwinNet. Demo page:☆111Updated 2 years ago
- TristouNet: Triplet Loss for Speaker Turn Embedding☆123Updated 7 years ago
- Implementation of Google's Tacotron in TensorFlow☆236Updated 6 years ago
- PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)☆516Updated 4 years ago
- Speech Enhancement Generative Adversarial Network in TensorFlow☆832Updated last year
- Code for "Vid2speech: Speech Reconstruction from Silent Video" ICASSP '17☆116Updated 8 years ago
- SampleRNN: An Unconditional End-to-End Neural Audio Generation Model☆541Updated 3 years ago
- Speech Recognition using DeepSpeech2 network and the CTC activation function.☆259Updated 7 years ago
- Speech Recognition Using Tacotron☆163Updated 7 years ago
- Transfer learning for music classification and regression tasks☆256Updated 5 years ago
- End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)☆315Updated 7 years ago
- CTC + Tensorflow Example for ASR☆311Updated 6 years ago
- The official repository of the Eesen project☆201Updated 8 years ago
- A method to generate speech across multiple speakers☆872Updated 6 years ago
- Keras implementation of deepmind's wavenet paper☆413Updated 5 years ago
- Neural net code for lexicon-free speech recognition with connectionist temporal classification☆248Updated 9 years ago
- DeepSpeech neon implementation☆223Updated 2 years ago
- Tensorflow implementation of the models used in "End-to-end learning for music audio tagging at scale"☆150Updated 5 years ago
- A short example of training a bidirectional LSTM model with connectionist temporal classification☆154Updated 6 years ago
- A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"☆491Updated 5 years ago
- A WaveNet-based vocoder for fast inference☆162Updated 6 years ago