A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LSTM models with those features.
☆55Mar 24, 2023Updated 3 years ago
Alternatives and similar repositories for Build-CNN-or-LSTM-or-CNNLSTM-with-speech-features
Users that are interested in Build-CNN-or-LSTM-or-CNNLSTM-with-speech-features are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Calculate MFCC/Fbank feature for wav files☆15Nov 21, 2017Updated 8 years ago
- In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the ceps…☆28Mar 8, 2020Updated 6 years ago
- A repository for emotion recognition from speech, text and mocap data from IEMOCAP dataset☆13Dec 12, 2018Updated 7 years ago
- Blind Source Separation: Independent Component Analysis for EEG data with python-MNE package and SSVEP☆12Nov 26, 2018Updated 7 years ago
- Bidirectional LSTM network for speech emotion recognition.☆266Mar 31, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- (tensorflow) Wiener Filter based Speech Enhancement(LSTM/BLSTM, GRU/BGRU, Transformer)☆15Dec 3, 2019Updated 6 years ago
- ☆18Nov 10, 2019Updated 6 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆67Nov 28, 2018Updated 7 years ago
- Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurren…☆23Apr 15, 2020Updated 6 years ago
- Deep learning using CNN for Mandarin Chinese tone classification☆38Apr 5, 2019Updated 7 years ago
- ☆10May 22, 2023Updated 3 years ago
- Matlab tools for pathological voice analysis☆14May 12, 2023Updated 3 years ago
- Sound event detection with depthwise separable and dilated convolutions.☆53Mar 30, 2020Updated 6 years ago
- ☆22Jul 28, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Masked ConditionaL Neural Networks☆15Jul 6, 2023Updated 2 years ago
- ASR for dysarthric speakers with Kaldi☆13Jan 14, 2017Updated 9 years ago
- Web app created to collect audios for course project☆10Apr 6, 2018Updated 8 years ago
- Die Webseite des Chaostreff Potsdam☆11May 21, 2026Updated last week
- Ein Programm zur Beschleunigung von Sprachaufnahmen☆12Apr 16, 2018Updated 8 years ago
- [Research] Monaural Speech Enhancement through Wave-U-Net (SEWUNet)☆31Nov 22, 2022Updated 3 years ago
- assignments for e6870 ASR class☆42Apr 23, 2019Updated 7 years ago
- VoiceCode is an Open Source initiative started by the National Research Council of Canada, to develop a programming by voice toolbox. The…☆10Apr 17, 2020Updated 6 years ago
- Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a …☆256Mar 3, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Supervised Speech Representation Learning for Parkinson's Disease Classification☆17Oct 26, 2021Updated 4 years ago
- Qt for Python workshop☆11Nov 23, 2021Updated 4 years ago
- (Theano) Implementations about deep neural network, recurrent neural network, LSTM, and structured learining.☆10Nov 9, 2016Updated 9 years ago
- ICASSP 2021 accepted paper☆20May 20, 2021Updated 5 years ago
- TensorFlow,DCGAN,VAE,LSTM,CNN,Acoustic Scene Classification☆11Jun 5, 2019Updated 6 years ago
- This repository allows to use kaldi to train an i-vector extractor and extract i-vectors through a python interface.☆11Nov 27, 2017Updated 8 years ago
- Detect Depression with AI Sub-challenge (DSS) of AVEC2019 experienment version via YZK☆15May 28, 2021Updated 5 years ago
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆17Sep 22, 2023Updated 2 years ago
- A set of speech feature extraction functions for ASR and speaker identification written in matlab.☆43Oct 28, 2016Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Synthesis speech detection based on Breathing-Talking-Silence sounds☆21Sep 3, 2025Updated 8 months ago
- A pipeline from Dataset Gathering,Data annotations, Model training,Model Evaluation for viseme (visual sound phoneme) classification☆15Jan 19, 2021Updated 5 years ago
- LI-FPN is an excellent model for depression recognition based on facial expression.☆16Apr 5, 2024Updated 2 years ago
- We present a study of a neural network based method for speech emotion recognition, using audio-only features. In the studied scheme, the…☆11Jul 24, 2024Updated last year
- some scripts for asvspoof2017☆11Dec 27, 2018Updated 7 years ago
- Introduction to Python Course Lectures and Supportive Articles☆10Mar 18, 2023Updated 3 years ago
- Deep Bi-LSTM, CNN and attention layer classifier for emotion detection from text☆10Mar 16, 2019Updated 7 years ago