batikim09 / keras_sgan_serLinks
This is a project of speech emotion recognition using KERAS based Semi-Generative Adversarial Networks.
☆11Updated 7 years ago
Alternatives and similar repositories for keras_sgan_ser
Users that are interested in keras_sgan_ser are comparing it to the libraries listed below
Sorting:
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 7 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Updated 5 years ago
- Orkis-Research / Quaternion-Convolutional-Neural-Networks-for-End-to-End-Automatic-Speech-RecognitionThis is the code for the paper 'Quaternion Convolutional Neural Networks for End-to-End Automatic Speech Recognition'. It provides all th…☆66Updated 6 years ago
- This repository contains the code to reproduce the core results from the paper "Scalable Factorized Hierarchical Variational Autoencoders…☆53Updated 7 years ago
- SEGAN pytorch implementation https://arxiv.org/abs/1703.09452☆110Updated 6 years ago
- Task 4 Large-scale weakly supervised sound event detection for smart cars☆67Updated 3 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- ☆27Updated 7 years ago
- CTC for emotion recognition☆61Updated 8 years ago
- DCASE2019 Challenge Task 1 baseline system☆20Updated 5 years ago
- Surrey CVSSP DCASE 2018 Task 2 system☆19Updated 2 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf☆43Updated 7 years ago
- ☆30Updated 6 years ago
- Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09…☆61Updated 7 years ago
- Cross-lingual Voice Conversion☆97Updated 7 years ago
- Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"☆98Updated 6 years ago
- ☆54Updated 7 years ago
- Tensorflow implementation of the speech model described in Neural Discrete Representation Learning (a.k.a. VQ-VAE)☆128Updated 7 years ago
- Training General-Purpose Audio Tagging Networks with Noisy Labels and Iterative Self-Verification☆29Updated 6 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated last year
- Implementations of vanilla autoencoder, VAE, and GAN in Tensorflow☆17Updated 8 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- TensorFlow Implementation of CDVAE-VC.☆54Updated 2 years ago
- This repository contains the code to reproduce the core results from the paper "Unsupervised Learning of Disentangled and Interpretable R…☆155Updated 7 years ago
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆39Updated 7 years ago
- Speech Enhancement using Bayesian WaveNet☆97Updated 7 years ago
- transformer for ASR-systerm (via tensorflow2.0)☆114Updated 6 years ago
- Cochlear.ai submission for dcase2018 task2☆15Updated 6 years ago
- Re-implementation the code used in Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder☆148Updated 6 years ago