ZhihaoDU/speech_feature_extractor

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZhihaoDU/speech_feature_extractor)

ZhihaoDU / speech_feature_extractor

Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplitude Modulation Spectrum(AMS) and so on.

☆129

Alternatives and similar repositories for speech_feature_extractor

Users that are interested in speech_feature_extractor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bingo-todd / Gammatone-filters
View on GitHub
Python implementation of Gammatone filter
☆25Jun 7, 2022Updated 4 years ago
hyli666 / DNN-SpeechEnhancement
View on GitHub
☆55Jul 21, 2019Updated 7 years ago
mcusi / gammatonegram
View on GitHub
Python version of http://www.ee.columbia.edu/ln/rosa/matlab/gammatonegram/
☆15Oct 15, 2018Updated 7 years ago
aishoot / Speech_Feature_Extraction
View on GitHub
Feature extraction of speech signal is the initial stage of any speech recognition system.
☆97Sep 3, 2020Updated 5 years ago
jqi41 / Gfcc
View on GitHub
Gammatone feature for robust speech recognition
☆14Aug 1, 2016Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yongxuUSTC / sednn
View on GitHub
deep learning based speech enhancement using keras or pytorch, make it easy to use
☆339Feb 26, 2020Updated 6 years ago
ifnspaml / Perceptual-Weighting-Filter-Loss
View on GitHub
A perceptual weighting filter loss for DNN training in speech enhancement
☆24Apr 30, 2022Updated 4 years ago
zhr1201 / Multi-channel-speech-extraction-using-DNN
View on GitHub
A tensorflow implementation of my paper Combining beamforming and deep neural networks for multi-channel speech extraction
☆69Dec 15, 2020Updated 5 years ago
zhr1201 / CNN-for-single-channel-speech-enhancement
View on GitHub
Convolutional neural nets for single channel speech enhancement
☆144Dec 15, 2020Updated 5 years ago
BYRTIMO / END-TO-END-SPEECH-ENHANCEMENT-BASED-ON-DISCRETE-COSINE-TRANSFORM
View on GitHub
☆18Nov 10, 2019Updated 6 years ago
speechLabBcCuny / onssen
View on GitHub
An open-source speech separation and enhancement library
☆214May 13, 2020Updated 6 years ago
TowerYsable / speech_enhancement_awesome
View on GitHub
☆24Oct 27, 2021Updated 4 years ago
anicolson / DeepXi
View on GitHub
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
☆523Feb 17, 2022Updated 4 years ago
danielbraithwt / Speech-Enhancement-with-Variance-Constrained-Autoencoders
View on GitHub
Code and audio files associated with the paper "Speech Enhancement with Variance Constrained Autoencoders" presented at Interspeech 2019
☆15Oct 10, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
funcwj / deep-clustering
View on GitHub
deep clustering method for single-channel speech separation
☆110Jun 21, 2022Updated 4 years ago
supikiti / PNCC
View on GitHub
A implementation of Power Normalized Cepstral Coefficients: PNCC
☆54Aug 11, 2019Updated 6 years ago
haoxiangsnr / A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement
View on GitHub
A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorc…
☆350Sep 5, 2020Updated 5 years ago
sp-uhh / mp-gtf
View on GitHub
Multi-Phase Gammatone Filterbank (MP-GTF) construction for Python
☆48Apr 30, 2020Updated 6 years ago
yunzqq / DeepMMSE
View on GitHub
DeepMMSE: A Deep Learning Approach to MMSE-based Noise Power Spectral Density Estimation
☆12Jun 4, 2020Updated 6 years ago
detly / gammatone
View on GitHub
Gammatone-based spectrograms, using gammatone filterbanks or Fourier transform weightings.
☆229Jun 29, 2023Updated 3 years ago
yuzhou-git / deep-casa
View on GitHub
Tensorflow implementation of deep CASA
☆65Jun 6, 2021Updated 5 years ago
yongxuUSTC / DNN-for-speech-enhancement
View on GitHub
DNN-for-speech-enhancement
☆176Feb 23, 2023Updated 3 years ago
funcwj / uPIT-for-speech-separation
View on GitHub
Speech separation with utterance-level PIT experiments
☆106Jul 12, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
funcwj / conv-tasnet
View on GitHub
A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" (see recipes in aps framework https:/…
☆219Jul 6, 2023Updated 3 years ago
huyanxin / phasen
View on GitHub
A unofficial Pytorch implementation of Microsoft's PHASEN
☆235Apr 10, 2024Updated 2 years ago
luan78zaoha / kaldi-timit-sre-ivector
View on GitHub
Develop speaker recognition model based on i-vector using TIMIT database
☆16Jul 4, 2019Updated 7 years ago
AppleHolic / audioset_augmentor
View on GitHub
Sound augmentation using Large-scale audio dataset (Audioset)
☆45Jun 29, 2021Updated 5 years ago
fwkz / lpcc-speech-recognition
View on GitHub
Speech recognition using Linear Predictive Cepstral Coefficients and Dynamic Time Wrapping algorithm.
☆15Feb 19, 2014Updated 12 years ago
nycsv / Speech_Enhancement_DNN_NMF
View on GitHub
Speech Enhancement based on DNN (Spectral-Mapping, TF-Masking), DNN-NMF, NMF
☆189Mar 29, 2019Updated 7 years ago
lifelongeek / AAS_enhancement
View on GitHub
This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…
☆28Oct 10, 2019Updated 6 years ago
mpariente / pystoi
View on GitHub
Python implementation of the Short Term Objective Intelligibility measure
☆359Dec 29, 2023Updated 2 years ago
alexdoberman / ma
View on GitHub
speech enhancement algorithms for microphone arrays
☆15May 12, 2020Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
TeaPoly / warp-ctc-crf
View on GitHub
An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.
☆12Jul 5, 2021Updated 5 years ago
aishoot / LSTM_PIT_Speech_Separation
View on GitHub
Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.
☆311Jan 6, 2022Updated 4 years ago
zhang201882 / MTF-CRNN
View on GitHub
Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurren…
☆23Apr 15, 2020Updated 6 years ago
fgnt / nn-gev
View on GitHub
Neural network supported GEV beamformer
☆216Feb 19, 2018Updated 8 years ago
naplab / DANet
View on GitHub
Deep Attractor Network (DANet) for single-channel speech separation
☆77Oct 1, 2018Updated 7 years ago
seanwood / gcc-nmf
View on GitHub
Real-time GCC-NMF Blind Speech Separation and Enhancement
☆327Apr 8, 2019Updated 7 years ago
vBaiCai / python-pesq
View on GitHub
A python package for calculating the PESQ.
☆410Jul 16, 2025Updated last year