用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现
☆53May 17, 2019Updated 6 years ago
Alternatives and similar repositories for SpeechProcessForMachineLearning
Users that are interested in SpeechProcessForMachineLearning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Calculate MFCC/Fbank feature for wav files☆15Nov 21, 2017Updated 8 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆67Nov 28, 2018Updated 7 years ago
- Flask webapp/endpoint that compares the user's speech with different accents and assigns similarity scores based on speed, voice (DTW/MFC…☆18Jun 27, 2017Updated 8 years ago
- A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LST…☆54Mar 24, 2023Updated 3 years ago
- ☆22Jul 28, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆10May 22, 2023Updated 2 years ago
- Repository for my paper: Dimensional Speech Emotion Recognition Using Acoustic Features and Word Embeddings using Multitask Learning☆17Aug 2, 2024Updated last year
- Web app created to collect audios for course project☆10Apr 6, 2018Updated 8 years ago
- A summary of speech data augment algorithms☆69Jan 12, 2021Updated 5 years ago
- Neural architecture search(NAS)☆10Jan 21, 2019Updated 7 years ago
- VoiceCode is an Open Source initiative started by the National Research Council of Canada, to develop a programming by voice toolbox. The…☆10Apr 17, 2020Updated 6 years ago
- 采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。☆70Jan 26, 2019Updated 7 years ago
- 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition☆118Jun 22, 2022Updated 3 years ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PyTorch implementation of Continuous Speech Separation☆12Oct 5, 2022Updated 3 years ago
- Generative Adversarial Networks for different impaired speech conversions☆39Jul 6, 2023Updated 2 years ago
- Removes silence segments from wav audio files☆29Feb 29, 2020Updated 6 years ago
- iSeparate library for the SDX2023 challenge☆15Dec 15, 2023Updated 2 years ago
- ☆16Sep 4, 2019Updated 6 years ago
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,422Oct 20, 2021Updated 4 years ago
- A set of speech feature extraction functions for ASR and speaker identification written in matlab.☆43Oct 28, 2016Updated 9 years ago
- Supervised Speech Representation Learning for Parkinson's Disease Classification☆17Oct 26, 2021Updated 4 years ago
- Code for Vision-Infused Deep Audio Inpainting (ICCV 2019)☆58Oct 25, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Sep 26, 2022Updated 3 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- Speaker identification using voice MFCCs and GMM☆54Dec 13, 2020Updated 5 years ago
- Gammatone feature for robust speech recognition☆14Aug 1, 2016Updated 9 years ago
- Whisper to Normal Speech Conversion with SC-MelGAN and SC-VQ-VAE☆15Dec 3, 2022Updated 3 years ago
- Feature extraction for accented-speech or pathological speech☆18Apr 2, 2019Updated 7 years ago
- Mispronunciation detection code for jingju singing voice☆20Sep 5, 2018Updated 7 years ago
- 基于GMM与MFCC特征进行数字0-9的语音识别,GMM,MFCC,语音识别,中文数据,sklearn,Digital Voice Recognition。☆19Jun 21, 2022Updated 3 years ago
- Optimizing speaker verification and spoofing countermeasure systems together with REINFORCE☆13Mar 31, 2021Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Counts frequencies of words using movie and television subtitles.☆20Jan 26, 2015Updated 11 years ago
- Speech recognition using Linear Predictive Cepstral Coefficients and Dynamic Time Wrapping algorithm.☆15Feb 19, 2014Updated 12 years ago
- ☆13Sep 23, 2025Updated 6 months ago
- A bunch of experiments using Bark and Mel scales, wavelets and paraconsistent feature engineering in order to find the best methods to cl…☆13Aug 16, 2023Updated 2 years ago
- A comparative analysis of speech signal processing algorithms for Parkinson’s disease classification and the use of the tunable Q-factor …☆15Dec 8, 2022Updated 3 years ago
- Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)☆16May 8, 2022Updated 3 years ago
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆378Jul 21, 2022Updated 3 years ago