vladimir-chernykh / emotion_recognitionLinks
CTC for emotion recognition
☆60Updated 8 years ago
Alternatives and similar repositories for emotion_recognition
Users that are interested in emotion_recognition are comparing it to the libraries listed below
Sorting:
- Implementation of state of the art d-vector approach for speaker verification☆127Updated 7 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆59Updated 5 years ago
- Speaker embedding(verification and recognition) using Tensorflow with Kaldi☆41Updated 7 years ago
- implement end-to-end asr algorithm with tensorflow☆40Updated 6 years ago
- Tensorflow implementation of "Speaker-independent Speech Separation with Deep Attractor Network"☆90Updated 4 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 7 years ago
- Keras Interface for Kaldi ASR☆121Updated 7 years ago
- A program for automatic speaker identification using deep learning techniques.☆84Updated 8 years ago
- transformer for ASR-systerm (via tensorflow2.0)☆114Updated 6 years ago
- Feedforward Sequential Memory Networks (FSMN) implemented by tensorflow☆52Updated 8 years ago
- Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).☆32Updated 6 years ago
- Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction procedure☆89Updated 7 years ago
- DCASE 2018 Baseline systems☆129Updated 5 years ago
- Share some recent speaker recognition papers and their implementations.☆90Updated 5 years ago
- ☆99Updated 7 years ago
- Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).☆130Updated 4 years ago
- Voice Activity Detector☆73Updated 2 years ago
- A set of speech feature extraction functions for ASR and speaker identification written in matlab.☆43Updated 8 years ago
- ☆40Updated 9 years ago
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.☆122Updated 5 years ago
- TristouNet: Triplet Loss for Speaker Turn Embedding☆123Updated 8 years ago
- ☆130Updated 6 years ago
- ASR for Chinese Mandarin☆75Updated 7 years ago
- DCASE 2017 Baseline system☆82Updated 5 years ago
- Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.☆72Updated 6 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆66Updated 6 years ago
- System for identifying speaker from given speech signal using MFCC,LPC features and Gaussian Mixture Models☆21Updated 7 years ago
- Code for end-to-end ASR with neural networks, build with TensorFlow☆109Updated 6 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 7 years ago
- An LDA/PLDA estimator using KALDI in python for speaker verification tasks☆100Updated 8 years ago