A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LSTM models with those features.
☆55Mar 24, 2023Updated 3 years ago
Alternatives and similar repositories for Build-CNN-or-LSTM-or-CNNLSTM-with-speech-features
Users that are interested in Build-CNN-or-LSTM-or-CNNLSTM-with-speech-features are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Depression-Detection represents a machine learning algorithm to classify audio using acoustic features in human speech, thus detecting de…☆14Jul 10, 2020Updated 5 years ago
- Calculate MFCC/Fbank feature for wav files☆15Nov 21, 2017Updated 8 years ago
- In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the ceps…☆28Mar 8, 2020Updated 6 years ago
- Blind Source Separation: Independent Component Analysis for EEG data with python-MNE package and SSVEP☆12Nov 26, 2018Updated 7 years ago
- Bidirectional LSTM network for speech emotion recognition.☆267Mar 31, 2019Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- (tensorflow) Wiener Filter based Speech Enhancement(LSTM/BLSTM, GRU/BGRU, Transformer)☆15Dec 3, 2019Updated 6 years ago
- 用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现☆53May 17, 2019Updated 6 years ago
- ☆18Nov 10, 2019Updated 6 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆67Nov 28, 2018Updated 7 years ago
- Several studies have been carried out to analyse Parkinson’s disease using speech impairments. Various tools and techniques have been use…☆12Apr 1, 2019Updated 7 years ago
- ☆10May 22, 2023Updated 2 years ago
- Matlab tools for pathological voice analysis☆14May 12, 2023Updated 2 years ago
- Sound event detection with depthwise separable and dilated convolutions.☆53Mar 30, 2020Updated 6 years ago
- ☆22Jul 28, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Masked ConditionaL Neural Networks☆15Jul 6, 2023Updated 2 years ago
- ASR for dysarthric speakers with Kaldi☆13Jan 14, 2017Updated 9 years ago
- simple energy vad☆19Jun 3, 2017Updated 8 years ago
- [Research] Monaural Speech Enhancement through Wave-U-Net (SEWUNet)☆31Nov 22, 2022Updated 3 years ago
- assignments for e6870 ASR class☆42Apr 23, 2019Updated 7 years ago
- VoiceCode is an Open Source initiative started by the National Research Council of Canada, to develop a programming by voice toolbox. The…☆10Apr 17, 2020Updated 6 years ago
- Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a …☆256Mar 3, 2023Updated 3 years ago
- Supervised Speech Representation Learning for Parkinson's Disease Classification☆17Oct 26, 2021Updated 4 years ago
- Acoustic feature extraction using Librosa library and openSMILE toolkit.使用Librosa音频处理库和openSMILE工具包,进行简单的声学特征提取☆217May 26, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- (Theano) Implementations about deep neural network, recurrent neural network, LSTM, and structured learining.☆10Nov 9, 2016Updated 9 years ago
- Classification of Urban sounds using several classification methods, namely SVM, MLP and CNN using MFCC features.☆13Apr 15, 2020Updated 6 years ago
- Lightweight and extensible DTN library☆19Mar 28, 2020Updated 6 years ago
- TensorFlow,DCGAN,VAE,LSTM,CNN,Acoustic Scene Classification☆11Jun 5, 2019Updated 6 years ago
- This repository allows to use kaldi to train an i-vector extractor and extract i-vectors through a python interface.☆11Nov 27, 2017Updated 8 years ago
- Detect Depression with AI Sub-challenge (DSS) of AVEC2019 experienment version via YZK☆15May 28, 2021Updated 4 years ago
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆17Sep 22, 2023Updated 2 years ago
- A set of speech feature extraction functions for ASR and speaker identification written in matlab.☆43Oct 28, 2016Updated 9 years ago
- Synthesis speech detection based on Breathing-Talking-Silence sounds☆21Sep 3, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A pipeline from Dataset Gathering,Data annotations, Model training,Model Evaluation for viseme (visual sound phoneme) classification☆14Jan 19, 2021Updated 5 years ago
- We present a study of a neural network based method for speech emotion recognition, using audio-only features. In the studied scheme, the…☆11Jul 24, 2024Updated last year
- System for Emotion Detection in given speech data using joint modelling of hand crafted prosody rich features , MFCC features and LSTM ba…☆10Nov 15, 2017Updated 8 years ago
- some scripts for asvspoof2017☆11Dec 27, 2018Updated 7 years ago
- ☆13May 9, 2022Updated 4 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.☆15Jun 4, 2019Updated 6 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆12May 7, 2019Updated 7 years ago