Calculate MFCC/Fbank feature for wav files
☆15Nov 21, 2017Updated 8 years ago
Alternatives and similar repositories for mfcc
Users that are interested in mfcc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Illustrating EM for GMMs and HMMs☆12May 9, 2020Updated 5 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆67Nov 28, 2018Updated 7 years ago
- 用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现☆53May 17, 2019Updated 6 years ago
- ☆13May 9, 2022Updated 3 years ago
- A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LST…☆54Mar 24, 2023Updated 2 years ago
- Whisper to Normal Speech Conversion with SC-MelGAN and SC-VQ-VAE☆15Dec 3, 2022Updated 3 years ago
- Gammatone feature for robust speech recognition☆14Aug 1, 2016Updated 9 years ago
- Feature extraction for accented-speech or pathological speech☆18Apr 2, 2019Updated 6 years ago
- Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and …☆20Oct 6, 2022Updated 3 years ago
- ☆11Apr 18, 2021Updated 4 years ago
- 基于GMM与MFCC特征进行数字0-9的语音识别,GMM,MFCC,语音识别,中文数据,sklearn,Digital Voice Recognition。☆19Jun 21, 2022Updated 3 years ago
- Optimizing speaker verification and spoofing countermeasure systems together with REINFORCE☆13Mar 31, 2021Updated 4 years ago
- Speech recognition using Linear Predictive Cepstral Coefficients and Dynamic Time Wrapping algorithm.☆15Feb 19, 2014Updated 12 years ago
- A bunch of experiments using Bark and Mel scales, wavelets and paraconsistent feature engineering in order to find the best methods to cl…☆13Aug 16, 2023Updated 2 years ago
- Official repo for the STRFNet system appeared in INTERSPEECH2020☆12Mar 6, 2021Updated 5 years ago
- ☆12May 30, 2019Updated 6 years ago
- ☆14Oct 12, 2023Updated 2 years ago
- 廖星宇深度学习入门之pytorch第一版书中代码实现☆11Jul 9, 2018Updated 7 years ago
- Code release for Grad-CAM Guided Attention Module for Fine-grained Visual Classification (MLSP 2022)☆13Aug 25, 2021Updated 4 years ago
- Repository of code for Speech emotion recognition using voiced speech and attention model, submitted to ICSigSys 2019☆13Jan 6, 2020Updated 6 years ago
- ☆12Oct 8, 2020Updated 5 years ago
- ☆13Mar 2, 2023Updated 3 years ago
- ☆22Jul 16, 2025Updated 8 months ago
- Develop speaker recognition model based on i-vector using TIMIT database☆16Jul 4, 2019Updated 6 years ago
- create CMakeLists.txt for kaldi☆20Apr 30, 2020Updated 5 years ago
- Python scripts for KiCad☆11Mar 21, 2016Updated 10 years ago
- ☆28Oct 7, 2025Updated 5 months ago
- FCN-rLSTM for vehicle counting in city cameras (unofficial implementation)☆13Jul 6, 2023Updated 2 years ago
- ☆51May 16, 2021Updated 4 years ago
- Python package for noise supression in audio based on DNN☆22Mar 24, 2023Updated 2 years ago
- Autoencoder(AE) based methods for anomalous sound detection(ASD)☆14Jan 10, 2023Updated 3 years ago
- Lightweight CNN for Robust Voice Activity Detection☆20Jun 30, 2023Updated 2 years ago
- Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurren…☆23Apr 15, 2020Updated 5 years ago
- A pytorch implementation of MFCC.☆33Jun 1, 2022Updated 3 years ago
- Scientific-Computing-with-Scala_Code☆16Jan 30, 2023Updated 3 years ago
- Recognition of Audio Captcha using SVM☆25Mar 29, 2019Updated 6 years ago
- GAM : Gradient Attention Module of Optimization for Point Clouds Analysis (AAAI2023)☆16Mar 19, 2023Updated 3 years ago
- Code for our paper "Acoustic Features Fusion using Attentive Multi-channel Deep Architecture" in Keras and tensorflow☆26Nov 23, 2018Updated 7 years ago
- The code of Unsupervised Few-Shot Image Classification by Learning Features into Clustering Space☆15Jul 12, 2022Updated 3 years ago