A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LSTM models with those features.
☆55Mar 24, 2023Updated 3 years ago
Alternatives and similar repositories for Build-CNN-or-LSTM-or-CNNLSTM-with-speech-features
Users that are interested in Build-CNN-or-LSTM-or-CNNLSTM-with-speech-features are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Depression-Detection represents a machine learning algorithm to classify audio using acoustic features in human speech, thus detecting de…☆14Jul 10, 2020Updated 5 years ago
- Calculate MFCC/Fbank feature for wav files☆15Nov 21, 2017Updated 8 years ago
- In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the ceps…☆28Mar 8, 2020Updated 6 years ago
- A repository for emotion recognition from speech, text and mocap data from IEMOCAP dataset☆13Dec 12, 2018Updated 7 years ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Blind Source Separation: Independent Component Analysis for EEG data with python-MNE package and SSVEP☆12Nov 26, 2018Updated 7 years ago
- Bidirectional LSTM network for speech emotion recognition.☆266Mar 31, 2019Updated 7 years ago
- (tensorflow) Wiener Filter based Speech Enhancement(LSTM/BLSTM, GRU/BGRU, Transformer)☆15Dec 3, 2019Updated 6 years ago
- 用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现☆54May 17, 2019Updated 7 years ago
- ☆18Nov 10, 2019Updated 6 years ago
- Several studies have been carried out to analyse Parkinson’s disease using speech impairments. Various tools and techniques have been use…☆12Apr 1, 2019Updated 7 years ago
- Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurren…☆23Apr 15, 2020Updated 6 years ago
- Perform three types of feature extraction: STFT, MFCC and MelSpectrogram. Apply CNN/VGG with or without RNN architecture. Able to achieve…☆15Jun 28, 2020Updated 5 years ago
- In this project, we wish to identify psychiatric disorders through patient's speech☆12Jun 6, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Deep learning using CNN for Mandarin Chinese tone classification☆38Apr 5, 2019Updated 7 years ago
- ☆10May 22, 2023Updated 3 years ago
- Matlab tools for pathological voice analysis☆14May 12, 2023Updated 3 years ago
- Sound event detection with depthwise separable and dilated convolutions.☆53Mar 30, 2020Updated 6 years ago
- Masked ConditionaL Neural Networks☆15Jul 6, 2023Updated 2 years ago
- ASR for dysarthric speakers with Kaldi☆13Jan 14, 2017Updated 9 years ago
- Web app created to collect audios for course project☆10Apr 6, 2018Updated 8 years ago
- simple energy vad☆19Jun 3, 2017Updated 9 years ago
- [Research] Monaural Speech Enhancement through Wave-U-Net (SEWUNet)☆31Nov 22, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- assignments for e6870 ASR class☆42Apr 23, 2019Updated 7 years ago
- Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a …☆256Mar 3, 2023Updated 3 years ago
- Supervised Speech Representation Learning for Parkinson's Disease Classification☆18Oct 26, 2021Updated 4 years ago
- (Theano) Implementations about deep neural network, recurrent neural network, LSTM, and structured learining.☆10Nov 9, 2016Updated 9 years ago
- ICASSP 2021 accepted paper☆20May 20, 2021Updated 5 years ago
- TensorFlow,DCGAN,VAE,LSTM,CNN,Acoustic Scene Classification☆11Jun 5, 2019Updated 7 years ago
- 语音识别 MFCCs特征处理 cnn神经网络☆104Jan 22, 2019Updated 7 years ago
- This repository allows to use kaldi to train an i-vector extractor and extract i-vectors through a python interface.☆11Nov 27, 2017Updated 8 years ago
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆17Sep 22, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A set of speech feature extraction functions for ASR and speaker identification written in matlab.☆43Oct 28, 2016Updated 9 years ago
- Synthesis speech detection based on Breathing-Talking-Silence sounds☆21Sep 3, 2025Updated 9 months ago
- An unofficial train-test split for ShipsEar: An underwater vessel noise database☆26Jul 31, 2024Updated last year
- LI-FPN is an excellent model for depression recognition based on facial expression.☆16Apr 5, 2024Updated 2 years ago
- System for Emotion Detection in given speech data using joint modelling of hand crafted prosody rich features , MFCC features and LSTM ba…☆10Nov 15, 2017Updated 8 years ago
- Introduction to Python Course Lectures and Supportive Articles☆10Mar 18, 2023Updated 3 years ago
- Deep Bi-LSTM, CNN and attention layer classifier for emotion detection from text☆10Mar 16, 2019Updated 7 years ago