Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem employing speech or audio (e.g., music). Here, we are interesting in voice disorder classification. That is, to develop two-class classifiers, which can…
☆256Mar 3, 2023Updated 3 years ago
Alternatives and similar repositories for Speech_Signal_Processing_and_Classification
Users that are interested in Speech_Signal_Processing_and_Classification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Neural Machine Translation using LSTMs and Attention mechanism. Two approaches were implemented, models, one without out attention using …☆12Jun 21, 2022Updated 3 years ago
- Tools for speech processing, keyword spotting☆17Mar 11, 2020Updated 6 years ago
- Feature extraction of speech signal is the initial stage of any speech recognition system.☆97Sep 3, 2020Updated 5 years ago
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Feb 1, 2019Updated 7 years ago
- RASTA-PLP and MFCC tool based rasta-mat☆33Jul 6, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplit…☆129Aug 12, 2020Updated 5 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆97May 30, 2020Updated 5 years ago
- Spoken Language Identification from Short Utterances☆13Jul 6, 2022Updated 3 years ago
- Speech enhancement system for the CHiME-5 dinner party scenario☆109Feb 6, 2025Updated last year
- speech enhancement using DNN: [1] Xu, Y., Du, J., Dai, L.R. and Lee, C.H., 2015. A regression approach to speech enhancement based on dee…☆14Sep 17, 2019Updated 6 years ago
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,422Oct 20, 2021Updated 4 years ago
- Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.☆310Jan 6, 2022Updated 4 years ago
- deep learning based speech enhancement using keras or pytorch, make it easy to use☆339Feb 26, 2020Updated 6 years ago
- System for identifying speaker from given speech signal using MFCC,LPC features and Gaussian Mixture Models☆21Nov 5, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Digital Signal Processing Library☆15May 4, 2017Updated 8 years ago
- A PyTorch implementation of Conv-TasNet☆46Nov 25, 2019Updated 6 years ago
- ☆20May 13, 2019Updated 6 years ago
- feature extraction from speech signals☆395Jun 15, 2025Updated 10 months ago
- Audio feature extraction and classification☆227Jul 6, 2023Updated 2 years ago
- ☆38Jul 20, 2020Updated 5 years ago
- 3gpp协议26073里面的vad的移植☆14Feb 14, 2019Updated 7 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Jan 8, 2021Updated 5 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆15Jul 15, 2019Updated 6 years ago
- In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the ceps…☆28Mar 8, 2020Updated 6 years ago
- A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LST…☆54Mar 24, 2023Updated 3 years ago
- Crypto projects in python, e.g. Attacks to Vigenere, RSA, Telnet Protocol, Hip Replacement , Vernam cipher, Crack Zip Files, Encryptions…☆20Mar 6, 2023Updated 3 years ago
- A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorc…☆346Sep 5, 2020Updated 5 years ago
- ☆35Apr 8, 2019Updated 7 years ago
- Tensorflow training scripts for depthwise separable convolutional neural networks for keyword spotting, and C++ code for deployment.☆41Apr 2, 2020Updated 6 years ago
- ☆11Nov 17, 2017Updated 8 years ago
- A temporal module for PyTorch-ComplexTensor☆44Jun 28, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆103Mar 18, 2019Updated 7 years ago
- (tensorflow) Wiener Filter based Speech Enhancement(LSTM/BLSTM, GRU/BGRU, Transformer)☆15Dec 3, 2019Updated 6 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- Implemented 3 neural network architectures: 1) Combination of RNN LSTM nodes and CNN, 2) CNN with residual blocks similar to ResNet, 3) D…☆25Jan 19, 2018Updated 8 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆212Jul 17, 2020Updated 5 years ago
- Tools for Speech Enhancement integrated with Kaldi☆430Jul 6, 2023Updated 2 years ago
- A Python toolbox for speech features extraction☆165Feb 8, 2023Updated 3 years ago