用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现
☆54May 17, 2019Updated 7 years ago
Alternatives and similar repositories for SpeechProcessForMachineLearning
Users that are interested in SpeechProcessForMachineLearning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- python codes to extract MFCC and FBANK speech features for Kaldi☆67Nov 28, 2018Updated 7 years ago
- A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LST…☆55Mar 24, 2023Updated 3 years ago
- ☆22Jul 28, 2018Updated 7 years ago
- ☆10May 22, 2023Updated 3 years ago
- Repository for my paper: Dimensional Speech Emotion Recognition Using Acoustic Features and Word Embeddings using Multitask Learning☆17Aug 2, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Web app created to collect audios for course project☆10Apr 6, 2018Updated 8 years ago
- A summary of speech data augment algorithms☆69Jan 12, 2021Updated 5 years ago
- Neural architecture search(NAS)☆10Jan 21, 2019Updated 7 years ago
- VoiceCode is an Open Source initiative started by the National Research Council of Canada, to develop a programming by voice toolbox. The…☆10Apr 17, 2020Updated 6 years ago
- 采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。☆70Jan 26, 2019Updated 7 years ago
- 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition☆119Jun 22, 2022Updated 3 years ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- Generative Adversarial Networks for different impaired speech conversions☆39Jul 6, 2023Updated 2 years ago
- Removes silence segments from wav audio files☆30Feb 29, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- iSeparate library for the SDX2023 challenge☆15Dec 15, 2023Updated 2 years ago
- ☆16Sep 4, 2019Updated 6 years ago
- Supervised Speech Representation Learning for Parkinson's Disease Classification☆18Oct 26, 2021Updated 4 years ago
- ☆11Sep 26, 2022Updated 3 years ago
- Speaker identification using voice MFCCs and GMM☆55Dec 13, 2020Updated 5 years ago
- A Python 2.7 implementation of Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) algorithms for Automated Speech …☆17Apr 23, 2018Updated 8 years ago
- Whisper to Normal Speech Conversion with SC-MelGAN and SC-VQ-VAE☆15Dec 3, 2022Updated 3 years ago
- Feature extraction for accented-speech or pathological speech☆18Apr 2, 2019Updated 7 years ago
- Mispronunciation detection code for jingju singing voice☆19Sep 5, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 基于GMM与MFCC特征进行数字0-9的语音识别,GMM,MFCC,语音识别,中文数据,sklearn,Digital Voice Recognition。☆18Jun 21, 2022Updated 3 years ago
- Optimizing speaker verification and spoofing countermeasure systems together with REINFORCE☆13Mar 31, 2021Updated 5 years ago
- Speech recognition using Linear Predictive Cepstral Coefficients and Dynamic Time Wrapping algorithm.☆15Feb 19, 2014Updated 12 years ago
- ☆13Sep 23, 2025Updated 8 months ago
- A bunch of experiments using Bark and Mel scales, wavelets and paraconsistent feature engineering in order to find the best methods to cl…☆13Aug 16, 2023Updated 2 years ago
- code for Multisample-based Contrastive Loss for Top-k Recommendation (IEEE TMM)☆10Nov 23, 2022Updated 3 years ago
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆378Jul 21, 2022Updated 3 years ago
- Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)☆16May 8, 2022Updated 4 years ago
- A comparative analysis of speech signal processing algorithms for Parkinson’s disease classification and the use of the tunable Q-factor …☆15Dec 8, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Audio or speech signal processing guide.☆57Jul 16, 2018Updated 7 years ago
- Pytorch: Channel-wise subband (CWS) input for better voice and accompaniment separation☆102Nov 12, 2021Updated 4 years ago
- MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks☆140Jun 7, 2021Updated 5 years ago
- Java Implementation of the Sonopy Audio Feature Extraction Library by MycroftAI☆16Feb 10, 2020Updated 6 years ago
- DNN and RCED speech enhancement☆20Jan 30, 2024Updated 2 years ago
- Classify documents using Python based on SVM and TF-IDF.☆15Nov 19, 2019Updated 6 years ago
- Illustrating EM for GMMs and HMMs☆12May 9, 2020Updated 6 years ago