jimmy-ren / lstm_speaker_naming_aaai16
Code to demonstrate multimodal LSTM
☆36Updated last year
Related projects ⓘ
Alternatives and complementary repositories for lstm_speaker_naming_aaai16
- ☆71Updated 7 years ago
- These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB☆50Updated 5 years ago
- ☆52Updated 7 years ago
- Sound event detection in real life audio with CNN submitted to DCASE16☆22Updated 2 years ago
- AENet: audio feature extraction☆60Updated 5 years ago
- tools around preparing TIMIT for HMM (with HTK) and deep learning (with Theano) methods☆78Updated 9 years ago
- Code for experiments with our RNN regularizer, which stochastically forces units to maintain previous values.☆78Updated 7 years ago
- ☆27Updated 6 years ago
- Stochastic Adaptive Neural Architecture Search☆66Updated 6 years ago
- Siamese neural networks for representation learning using Theano.☆22Updated 9 years ago
- DCASE 2016 Baseline system, python implementation☆51Updated 7 years ago
- Adversarial Unsupervised Domain Adaptation for Acoustic Scene Classification☆35Updated 6 years ago
- Training neural networks with back-prop, feedback-alignment and direct feedback-alignment☆11Updated 7 years ago
- Faster Deep Neural Networks☆36Updated 7 years ago
- A MATLAB simulation of speech recognition based on pattern analysis, Mel Frequency Cepstral Coefficients as extracted feature and Dynamc …☆9Updated 9 years ago
- Keras Implementation of "Look, Listen and Learn" Model☆21Updated 7 years ago
- Vectorized multimodal LSTM using Matlab and GPU☆31Updated 8 years ago
- Convolutional neural networks for sound classification☆20Updated 6 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 6 years ago
- The source code for Temporal Attention-Gated Model.☆21Updated 7 years ago
- THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is c…☆33Updated 6 years ago
- TristouNet: Triplet Loss for Speaker Turn Embedding☆125Updated 7 years ago
- Auralisation of learned features in CNN (for audio)☆42Updated 7 years ago
- TensorFlow implementation of "SoundNet".☆145Updated 6 years ago
- Python wrappers for Kaldi data☆61Updated 7 years ago
- ☆0Updated 6 years ago
- ☆66Updated 8 years ago