A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LSTM models with those features.
☆54Mar 24, 2023Updated 3 years ago
Alternatives and similar repositories for Build-CNN-or-LSTM-or-CNNLSTM-with-speech-features
Users that are interested in Build-CNN-or-LSTM-or-CNNLSTM-with-speech-features are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Depression-Detection represents a machine learning algorithm to classify audio using acoustic features in human speech, thus detecting de…☆14Jul 10, 2020Updated 5 years ago
- Calculate MFCC/Fbank feature for wav files☆15Nov 21, 2017Updated 8 years ago
- In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the ceps…☆28Mar 8, 2020Updated 6 years ago
- A repository for emotion recognition from speech, text and mocap data from IEMOCAP dataset☆13Dec 12, 2018Updated 7 years ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Blind Source Separation: Independent Component Analysis for EEG data with python-MNE package and SSVEP☆12Nov 26, 2018Updated 7 years ago
- Bidirectional LSTM network for speech emotion recognition.☆267Mar 31, 2019Updated 7 years ago
- (tensorflow) Wiener Filter based Speech Enhancement(LSTM/BLSTM, GRU/BGRU, Transformer)☆15Dec 3, 2019Updated 6 years ago
- ☆18Nov 10, 2019Updated 6 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆67Nov 28, 2018Updated 7 years ago
- Several studies have been carried out to analyse Parkinson’s disease using speech impairments. Various tools and techniques have been use…☆12Apr 1, 2019Updated 7 years ago
- Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurren…☆23Apr 15, 2020Updated 6 years ago
- Perform three types of feature extraction: STFT, MFCC and MelSpectrogram. Apply CNN/VGG with or without RNN architecture. Able to achieve…☆15Jun 28, 2020Updated 5 years ago
- In this project, we wish to identify psychiatric disorders through patient's speech☆12Jun 6, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Deep learning using CNN for Mandarin Chinese tone classification☆38Apr 5, 2019Updated 7 years ago
- ☆10May 22, 2023Updated 2 years ago
- Matlab tools for pathological voice analysis☆14May 12, 2023Updated 2 years ago
- Sound event detection with depthwise separable and dilated convolutions.☆53Mar 30, 2020Updated 6 years ago
- ☆22Jul 28, 2018Updated 7 years ago
- Masked ConditionaL Neural Networks☆15Jul 6, 2023Updated 2 years ago
- ASR for dysarthric speakers with Kaldi☆13Jan 14, 2017Updated 9 years ago
- Web app created to collect audios for course project☆10Apr 6, 2018Updated 8 years ago
- [Research] Monaural Speech Enhancement through Wave-U-Net (SEWUNet)☆31Nov 22, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- VoiceCode is an Open Source initiative started by the National Research Council of Canada, to develop a programming by voice toolbox. The…☆10Apr 17, 2020Updated 6 years ago
- Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a …☆256Mar 3, 2023Updated 3 years ago
- Supervised Speech Representation Learning for Parkinson's Disease Classification☆17Oct 26, 2021Updated 4 years ago
- Acoustic feature extraction using Librosa library and openSMILE toolkit.使用Librosa音频处理库和openSMILE工具包,进行简单的声学特征提取☆217May 26, 2020Updated 5 years ago
- ICASSP 2021 accepted paper☆20May 20, 2021Updated 4 years ago
- TensorFlow,DCGAN,VAE,LSTM,CNN,Acoustic Scene Classification☆11Jun 5, 2019Updated 6 years ago
- 语音识别 MFCCs特征处理 cnn神经网络☆105Jan 22, 2019Updated 7 years ago
- This repository allows to use kaldi to train an i-vector extractor and extract i-vectors through a python interface.☆11Nov 27, 2017Updated 8 years ago
- Detect Depression with AI Sub-challenge (DSS) of AVEC2019 experienment version via YZK☆15May 28, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆17Sep 22, 2023Updated 2 years ago
- A set of speech feature extraction functions for ASR and speaker identification written in matlab.☆43Oct 28, 2016Updated 9 years ago
- 网易云音乐(flutter)☆12Oct 13, 2022Updated 3 years ago
- A pipeline from Dataset Gathering,Data annotations, Model training,Model Evaluation for viseme (visual sound phoneme) classification☆14Jan 19, 2021Updated 5 years ago
- LI-FPN is an excellent model for depression recognition based on facial expression.☆16Apr 5, 2024Updated 2 years ago
- We present a study of a neural network based method for speech emotion recognition, using audio-only features. In the studied scheme, the…☆11Jul 24, 2024Updated last year
- System for Emotion Detection in given speech data using joint modelling of hand crafted prosody rich features , MFCC features and LSTM ba…☆10Nov 15, 2017Updated 8 years ago