aishoot/Speech_Feature_Extraction

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aishoot/Speech_Feature_Extraction)

aishoot / Speech_Feature_Extraction

Feature extraction of speech signal is the initial stage of any speech recognition system.

☆97

Alternatives and similar repositories for Speech_Feature_Extraction

Users that are interested in Speech_Feature_Extraction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZhihaoDU / speech_feature_extractor
View on GitHub
Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplit…
☆129Aug 12, 2020Updated 5 years ago
matthijsvk / TIMITspeech
View on GitHub
Speech recognition on the TIMIT (or any other) dataset
☆44Nov 2, 2017Updated 8 years ago
shincling / TDAAv2
View on GitHub
The updated version of TDAA model.
☆14Jul 2, 2020Updated 6 years ago
gionanide / Speech_Signal_Processing_and_Classification
View on GitHub
Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a …
☆257Mar 3, 2023Updated 3 years ago
acids-ircam / lottery_mir
View on GitHub
Ultra-light MIR models with a structured lottery ticket hypothesis approach
☆13Sep 21, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mostafaelaraby / Tensorflow-Keyword-Spotting
View on GitHub
Keyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.
☆29Feb 12, 2018Updated 8 years ago
aishoot / LSTM_PIT_Speech_Separation
View on GitHub
Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.
☆311Jan 6, 2022Updated 4 years ago
ansleliu / ConvolutionaNeuralNetworksToEnhanceCodedSpeech
View on GitHub
In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the ceps…
☆28Mar 8, 2020Updated 6 years ago
pancak3 / HMM-Viterbi-CUDA
View on GitHub
Parallel and Multicore Computing Project 2
☆12Apr 16, 2020Updated 6 years ago
dpwe / calc_sbpca
View on GitHub
Subband PCA feature calculation
☆16Nov 5, 2018Updated 7 years ago
hyli666 / DNN-SpeechEnhancement
View on GitHub
☆55Jul 21, 2019Updated 7 years ago
qqueing / pytorch-G2P
View on GitHub
(semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean
☆23Dec 17, 2017Updated 8 years ago
espnet / warp-ctc
View on GitHub
Pytorch Bindings for warp-ctc maintained by ESPnet
☆17Feb 20, 2021Updated 5 years ago
xiaoxiaomiao323 / MSA
View on GitHub
☆16Feb 19, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
pengzhendong / speaker-diarization
View on GitHub
Offline Speaker Diarization with SenseVoice by Sherpa ONNX.
☆15Dec 23, 2024Updated last year
anicolson / matlab_feat
View on GitHub
Functions for creating speech features in MATLAB.
☆14Jul 7, 2020Updated 6 years ago
Cocoxili / VAD
View on GitHub
Voice Activity Detection
☆29Nov 13, 2017Updated 8 years ago
KWTsou1220 / mann-for-speech-separation
View on GitHub
Neural Turing machine for source separation in Tensorflow
☆18Aug 16, 2017Updated 8 years ago
janson9192 / autokws2021
View on GitHub
☆13Mar 25, 2021Updated 5 years ago
genzen2103 / Speaker-Recognition-System-using-GMM
View on GitHub
System for identifying speaker from given speech signal using MFCC,LPC features and Gaussian Mixture Models
☆21Nov 5, 2017Updated 8 years ago
liyongze / lstm_speaker_verification
View on GitHub
☆35Apr 8, 2019Updated 7 years ago
athena-team / athena-transform
View on GitHub
☆21Jan 13, 2020Updated 6 years ago
keithyin / simple_speech_recog
View on GitHub
☆24Apr 13, 2018Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
chloelee1230 / nlp_ranking
View on GitHub
☆10May 22, 2023Updated 3 years ago
sandepp123 / Speech_Emotion_Recognition
View on GitHub
Classifies Emotion in Speech Signal
☆10May 30, 2016Updated 10 years ago
dictation-toolbox / voicecode
View on GitHub
VoiceCode is an Open Source initiative started by the National Research Council of Canada, to develop a programming by voice toolbox. The…
☆10Apr 17, 2020Updated 6 years ago
chunmeifeng / FedIns
View on GitHub
【ICCV 2023】Towards Instance-adaptive Inference for Federated Learning
☆12Mar 31, 2025Updated last year
drbinliang / Speech_Recognition
View on GitHub
A simple speech recognition using HMM (python)
☆61Apr 30, 2014Updated 12 years ago
tpeet / ML-KWS-for-MCU
View on GitHub
Keyword spotting on Arm Cortex-M Microcontrollers
☆14May 20, 2019Updated 7 years ago
ashwin9999 / speech-recognition-CNN
View on GitHub
A speech recognition system based on a Convolutional Neural Network built using TensorFlow
☆22Dec 6, 2020Updated 5 years ago
ZitengWang / python_kaldi_features
View on GitHub
python codes to extract MFCC and FBANK speech features for Kaldi
☆67Nov 28, 2018Updated 7 years ago
tmalsburg / PsychlingDatasets
View on GitHub
A list of publicly available data sets from psycholinguistic studies
☆31Oct 25, 2016Updated 9 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
CynthiaSuwi / Wavenet-demo
View on GitHub
A TensorFlow implementation for Chinese speech recognition based on DeepMind's WaveNet
☆15Mar 27, 2018Updated 8 years ago
zooniverse / WhaleFM
View on GitHub
Whale FM archive, data
☆16Mar 4, 2015Updated 11 years ago
TokyoTechX-TAs / web-crawler
View on GitHub
Python-based cross-platform tool for mining text data (html, transcript, problems) of edX MOOCs on a user's dashboard. It is an extension…
☆10Feb 12, 2020Updated 6 years ago
nghiapq77 / voice-verification
View on GitHub
Zalo AI Challenge 2020 - Top 2 @ Voice Verification
☆15Oct 4, 2022Updated 3 years ago
funcwj / conv-tasnet
View on GitHub
A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" (see recipes in aps framework https:/…
☆219Jul 6, 2023Updated 3 years ago
usc-sail / mica-speech-activity-detection
View on GitHub
Robust Speech Activity Detection (SAD) in movie audio
☆26Jan 27, 2021Updated 5 years ago
kaituoxu / TasNet
View on GitHub
A PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation…
☆125Jan 27, 2019Updated 7 years ago