douglas125 / SpeechCmdRecognition
A neural attention model for speech command recognition
☆185Updated 2 years ago
Alternatives and similar repositories for SpeechCmdRecognition:
Users that are interested in SpeechCmdRecognition are comparing it to the libraries listed below
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆211Updated 4 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆73Updated 3 years ago
- This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1☆111Updated 5 years ago
- Problem Agnostic Speech Encoder☆440Updated last year
- A statistical model-based Voice Activity Detection☆192Updated 6 years ago
- Speech Denoising with Deep Feature Losses☆185Updated 4 years ago
- A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorc…☆320Updated 4 years ago
- ESC: Dataset for Environmental Sound Classification - paper replication data☆78Updated 7 years ago
- A speaker recognition system which uses GMM-UBM for use in an Android application which helps in monitoring patients suffering from Schiz…☆55Updated 6 years ago
- Include some core functions and model to handle speech separation☆155Updated 3 years ago
- Feature extraction of speech signal is the initial stage of any speech recognition system.☆92Updated 4 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆106Updated 2 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆207Updated 3 years ago
- Deep neural network based speech enhancement toolkit☆215Updated 5 years ago
- ☆155Updated 4 years ago
- Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.☆309Updated 3 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆65Updated 4 years ago
- Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemente…☆218Updated 2 years ago
- End-to-End Speech Recognition Using Tensorflow☆41Updated 2 years ago
- HTK features in Python☆74Updated 6 years ago
- Deep Neural Network for Speaker Count Estimation☆150Updated 4 years ago
- Text Independent Speaker Verification Using GE2E Loss☆84Updated 6 years ago
- Voice Activity Detection (VAD) using deep learning.☆196Updated 5 years ago
- Time delay neural network (TDNN) implementation in Pytorch using unfold method☆202Updated 5 years ago
- Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)☆99Updated 5 years ago
- Single and multichannel sound event detection using convolutional recurrent neural networks. DCASE 2017 real-life sound event detection w…☆191Updated 2 years ago
- 🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)☆224Updated 4 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆314Updated 4 years ago