wblgers / hmm_speech_recognition_demoLinks
A demo for simple isolated Chinese speech word recognition using GMMHMM in Python
☆42Updated 7 years ago
Alternatives and similar repositories for hmm_speech_recognition_demo
Users that are interested in hmm_speech_recognition_demo are comparing it to the libraries listed below
Sorting:
- A python implementation of isolated word recognition using Hidden Markov Model☆40Updated 8 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆74Updated 4 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆36Updated 6 years ago
- LogMMSE speech enhancement/noise reduction☆90Updated 5 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 6 years ago
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆44Updated 5 years ago
- Feature extraction of speech signal is the initial stage of any speech recognition system.☆96Updated 5 years ago
- Python implementation of simple GMM and HMM models for isolated digit recognition.☆67Updated 4 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆46Updated 5 years ago
- Attention-based model for keywords spotting☆19Updated 4 years ago
- Tensorflow implementation of "Small-Footprint Keyword Spotting with Multi-Scale Temporal Convolution"(INTERSPEECH 2020)☆32Updated 5 years ago
- ☆60Updated 5 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 3 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆109Updated 3 years ago
- Audio data augmentation examples☆34Updated 7 years ago
- SoundPy (alpha stage) is a research-based python package for speech and sound. Applications include deep-learning, filtering, speech-enha…☆76Updated 11 months ago
- Speech recognition on the TIMIT (or any other) dataset☆44Updated 8 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Updated 5 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆68Updated 4 years ago
- A pytorch implementation of xvector embedding☆79Updated 5 years ago
- transformer for ASR-systerm (via tensorflow2.0)☆114Updated 6 years ago
- DDAE speech enhancement on spectrogram domain using Keras☆25Updated 8 years ago
- ☆35Updated 6 years ago
- PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).☆39Updated 6 years ago
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Updated 4 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆103Updated 6 years ago
- RASTA-PLP and MFCC tool based rasta-mat☆33Updated 3 years ago
- A tensorflow implementation of my paper Combining beamforming and deep neural networks for multi-channel speech extraction☆68Updated 5 years ago
- Speaker diarization with GMM-UBM and MAP Adaptation☆31Updated 7 years ago