Magic-Bubble/SpeechProcessForMachineLearning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Magic-Bubble/SpeechProcessForMachineLearning)

Magic-Bubble / SpeechProcessForMachineLearning

用于机器学习的语音特征提取，包含FBank和MFCC等，原理讲解和step by step的实现

☆54

Alternatives and similar repositories for SpeechProcessForMachineLearning

Users that are interested in SpeechProcessForMachineLearning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

adam2go / mfcc
View on GitHub
Calculate MFCC/Fbank feature for wav files
☆15Nov 21, 2017Updated 8 years ago
spaceraccoon / accent-trainer
View on GitHub
Flask webapp/endpoint that compares the user's speech with different accents and assigns similarity scores based on speed, voice (DTW/MFC…
☆18Jun 27, 2017Updated 9 years ago
AI-HPC-Research-Team / LIGO_noise_suppression
View on GitHub
deep neural network based workflow for noise suppression and signal recovery of real-world LIGO observational data
☆16Mar 14, 2024Updated 2 years ago
ZitengWang / python_kaldi_features
View on GitHub
python codes to extract MFCC and FBANK speech features for Kaldi
☆67Nov 28, 2018Updated 7 years ago
royswastik / intelligent-team-building-recommendation-system
View on GitHub
☆22Jul 28, 2018Updated 8 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
a-n-rose / Build-CNN-or-LSTM-or-CNNLSTM-with-speech-features
View on GitHub
A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LST…
☆55Mar 24, 2023Updated 3 years ago
chloelee1230 / nlp_ranking
View on GitHub
☆10May 22, 2023Updated 3 years ago
bagustris / dimensional-ser
View on GitHub
Repository for my paper: Dimensional Speech Emotion Recognition Using Acoustic Features and Word Embeddings using Multitask Learning
☆17Aug 2, 2024Updated last year
ichn-hu / DSP-Audio-Collector
View on GitHub
Web app created to collect audios for course project
☆10Apr 6, 2018Updated 8 years ago
AdinAck / Voice2Voice
View on GitHub
A Python neural network made with TensorFlow that converts one person's voice into another.
☆10Jan 16, 2021Updated 5 years ago
zzpDapeng / speech_data_augment
View on GitHub
A summary of speech data augment algorithms
☆69Jan 12, 2021Updated 5 years ago
cam-mobsys / covid19-sounds-kdd20
View on GitHub
☆15Nov 25, 2020Updated 5 years ago
amitchone / ASR
View on GitHub
A Python 2.7 implementation of Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) algorithms for Automated Speech …
☆16Apr 23, 2018Updated 8 years ago
zw76859420 / ASR_WORD
View on GitHub
采用端到端方法构建声学模型，以字为建模单元，采用DCNN-CTC网络结构。
☆71Jan 26, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tencent-ailab / 3m-asr
View on GitHub
3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
☆119Jun 22, 2022Updated 4 years ago
mauriciovander / silence-removal
View on GitHub
Removes silence segments from wav audio files
☆30Feb 29, 2020Updated 6 years ago
FantSun / Speechflow
View on GitHub
Speechflow for emotion recognition related information decomposition
☆10Jul 27, 2021Updated 5 years ago
jameslyons / matlab_speech_features
View on GitHub
A set of speech feature extraction functions for ASR and speaker identification written in matlab.
☆43Oct 28, 2016Updated 9 years ago
desh2608 / css
View on GitHub
PyTorch implementation of Continuous Speech Separation
☆12Oct 5, 2022Updated 3 years ago
naba89 / iSeparate-SDX
View on GitHub
iSeparate library for the SDX2023 challenge
☆15Dec 15, 2023Updated 2 years ago
ronggong / mispronunciation-detection
View on GitHub
Mispronunciation detection code for jingju singing voice
☆19Sep 5, 2018Updated 7 years ago
jameslyons / python_speech_features
View on GitHub
This library provides common speech features for ASR including MFCCs and filterbank energies.
☆2,423Oct 20, 2021Updated 4 years ago
ensismoebius / voiceSpoofingDetectionWavelet
View on GitHub
A bunch of experiments using Bark and Mel scales, wavelets and paraconsistent feature engineering in order to find the best methods to cl…
☆12Aug 16, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
toeybaa / place365
View on GitHub
☆16Jul 13, 2016Updated 10 years ago
Hangz-nju-cuhk / Vision-Infused-Audio-Inpainter-VIAI
View on GitHub
Code for Vision-Infused Deep Audio Inpainting (ICCV 2019)
☆58Oct 25, 2019Updated 6 years ago
idiap / pddetection-reps-learning
View on GitHub
Supervised Speech Representation Learning for Parkinson's Disease Classification
☆18Oct 26, 2021Updated 4 years ago
SuperKogito / Voice-based-speaker-identification
View on GitHub
Speaker identification using voice MFCCs and GMM
☆56Dec 13, 2020Updated 5 years ago
jqi41 / Gfcc
View on GitHub
Gammatone feature for robust speech recognition
☆14Aug 1, 2016Updated 9 years ago
dcleres / Parkinson_Disease_ML
View on GitHub
A comparative analysis of speech signal processing algorithms for Parkinson’s disease classification and the use of the tunable Q-factor …
☆15Dec 8, 2022Updated 3 years ago
dwgnr / speech-conversion
View on GitHub
Whisper to Normal Speech Conversion with SC-MelGAN and SC-VQ-VAE
☆15Dec 3, 2022Updated 3 years ago
FreedomIntelligence / MTalk-Bench
View on GitHub
MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols
☆20Nov 19, 2025Updated 8 months ago
WordsAPI / wordfrequencies
View on GitHub
Counts frequencies of words using movie and television subtitles.
☆20Jan 26, 2015Updated 11 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
tbright17 / accent-feat
View on GitHub
Feature extraction for accented-speech or pathological speech
☆18Apr 2, 2019Updated 7 years ago
zhengyima / GMM_Digital_Voice_Recognition
View on GitHub
基于GMM与MFCC特征进行数字0-9的语音识别，GMM，MFCC，语音识别，中文数据，sklearn，Digital Voice Recognition。
☆18Jun 21, 2022Updated 4 years ago
collectivat / cmusphinx-models
View on GitHub
Acoustic and language models for minorised languages.
☆26Jul 17, 2026Updated last week
linan2 / TensorFlow-speech-enhancement
View on GitHub
DNN and RCED speech enhancement
☆20Jan 30, 2024Updated 2 years ago
haotangxjtu / MSCL
View on GitHub
code for Multisample-based Contrastive Loss for Top-k Recommendation (IEEE TMM)
☆10Nov 23, 2022Updated 3 years ago
chutaklee / CantoASR
View on GitHub
Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)
☆16May 8, 2022Updated 4 years ago
ZhengkunTian / OpenTransformer
View on GitHub
A No-Recurrence Sequence-to-Sequence Model for Speech Recognition
☆378Jul 21, 2022Updated 4 years ago