batikim09 / keras_sgan_serLinks
This is a project of speech emotion recognition using KERAS based Semi-Generative Adversarial Networks.
☆11Updated 7 years ago
Alternatives and similar repositories for keras_sgan_ser
Users that are interested in keras_sgan_ser are comparing it to the libraries listed below
Sorting:
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 7 years ago
- CTC for emotion recognition☆61Updated 8 years ago
- Cross-lingual Voice Conversion☆97Updated 7 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated last year
- Tensorflow implementation of the speech model described in Neural Discrete Representation Learning (a.k.a. VQ-VAE)☆128Updated 7 years ago
- This repository contains the code to reproduce the core results from the paper "Scalable Factorized Hierarchical Variational Autoencoders…☆53Updated 7 years ago
- pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf☆43Updated 7 years ago
- Orkis-Research / Quaternion-Convolutional-Neural-Networks-for-End-to-End-Automatic-Speech-RecognitionThis is the code for the paper 'Quaternion Convolutional Neural Networks for End-to-End Automatic Speech Recognition'. It provides all th…☆66Updated 6 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Updated 5 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- ☆30Updated 6 years ago
- Time Delayed NN implemented in pytorch☆81Updated 8 years ago
- ☆45Updated 6 years ago
- transformer for ASR-systerm (via tensorflow2.0)☆114Updated 6 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- ☆54Updated 7 years ago
- ☆27Updated 7 years ago
- Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"☆98Updated 6 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆69Updated 2 years ago
- Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.☆72Updated 6 years ago
- A Tensorflow Implementation of VQ-VAE Speaker Conversion☆42Updated 7 years ago
- Re-implementation the code used in Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder☆148Updated 6 years ago
- Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09…☆61Updated 7 years ago
- This repository contains the code to reproduce the core results from the paper "Unsupervised Learning of Disentangled and Interpretable R…☆155Updated 7 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 7 years ago
- 2018/2019 TTS framework integrating state of the art open source methods☆47Updated 6 years ago
- SEGAN pytorch implementation https://arxiv.org/abs/1703.09452☆110Updated 6 years ago
- Correspondence and autoencoder neural network training for speech using Pylearn2.☆14Updated 9 years ago
- Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks☆64Updated 6 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 7 years ago