用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现
☆53May 17, 2019Updated 6 years ago
Alternatives and similar repositories for SpeechProcessForMachineLearning
Users that are interested in SpeechProcessForMachineLearning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Calculate MFCC/Fbank feature for wav files☆15Nov 21, 2017Updated 8 years ago
- deep neural network based workflow for noise suppression and signal recovery of real-world LIGO observational data☆16Mar 14, 2024Updated 2 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆67Nov 28, 2018Updated 7 years ago
- Flask webapp/endpoint that compares the user's speech with different accents and assigns similarity scores based on speed, voice (DTW/MFC…☆18Jun 27, 2017Updated 8 years ago
- A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LST…☆54Mar 24, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆22Jul 28, 2018Updated 7 years ago
- ☆10May 22, 2023Updated 2 years ago
- Repository for my paper: Dimensional Speech Emotion Recognition Using Acoustic Features and Word Embeddings using Multitask Learning☆17Aug 2, 2024Updated last year
- Web app created to collect audios for course project☆10Apr 6, 2018Updated 7 years ago
- ☆15Nov 25, 2020Updated 5 years ago
- 采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。☆70Jan 26, 2019Updated 7 years ago
- 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition☆118Jun 22, 2022Updated 3 years ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- Generative Adversarial Networks for different impaired speech conversions☆39Jul 6, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆17Nov 15, 2021Updated 4 years ago
- Removes silence segments from wav audio files☆29Feb 29, 2020Updated 6 years ago
- iSeparate library for the SDX2023 challenge☆15Dec 15, 2023Updated 2 years ago
- ☆16Sep 4, 2019Updated 6 years ago
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,423Oct 20, 2021Updated 4 years ago
- A set of speech feature extraction functions for ASR and speaker identification written in matlab.☆43Oct 28, 2016Updated 9 years ago
- Code for ACM MM2020 paper: Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localization☆34Sep 3, 2020Updated 5 years ago
- Supervised Speech Representation Learning for Parkinson's Disease Classification☆17Oct 26, 2021Updated 4 years ago
- Code for Vision-Infused Deep Audio Inpainting (ICCV 2019)☆58Oct 25, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- ☆11Sep 26, 2022Updated 3 years ago
- Speaker identification using voice MFCCs and GMM☆54Dec 13, 2020Updated 5 years ago
- A Python 2.7 implementation of Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) algorithms for Automated Speech …☆17Apr 23, 2018Updated 7 years ago
- Whisper to Normal Speech Conversion with SC-MelGAN and SC-VQ-VAE☆15Dec 3, 2022Updated 3 years ago
- Gammatone feature for robust speech recognition☆14Aug 1, 2016Updated 9 years ago
- Mispronunciation detection code for jingju singing voice☆20Sep 5, 2018Updated 7 years ago
- 基于GMM与MFCC特征进行数字0-9的语音识别,GMM,MFCC,语音识别,中文数据,sklearn,Digital Voice Recognition。☆19Jun 21, 2022Updated 3 years ago
- Optimizing speaker verification and spoofing countermeasure systems together with REINFORCE☆13Mar 31, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)☆16May 8, 2022Updated 3 years ago
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆379Jul 21, 2022Updated 3 years ago
- 《Python深度学习(第2版)》代码及笔记☆22Nov 24, 2022Updated 3 years ago
- Pytorch: Channel-wise subband (CWS) input for better voice and accompaniment separation☆101Nov 12, 2021Updated 4 years ago
- Java Implementation of the Sonopy Audio Feature Extraction Library by MycroftAI☆16Feb 10, 2020Updated 6 years ago
- DNN and RCED speech enhancement☆20Jan 30, 2024Updated 2 years ago
- Illustrating EM for GMMs and HMMs☆12May 9, 2020Updated 5 years ago