sshh12 / Conv-VAD
A packaged convolutional voice activity detector for noisy environments.
☆14Updated 5 years ago
Alternatives and similar repositories for Conv-VAD:
Users that are interested in Conv-VAD are comparing it to the libraries listed below
- magicspeech competition recipe☆18Updated 4 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 6 years ago
- Denoise Speech (Enhanced Speech or Speech enhancement) by Deep Learning (Using Keras and Tensorflow)☆39Updated 6 years ago
- Various algorithms for voice activity detection☆22Updated 8 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆19Updated 4 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- Curriculum Vitae of Quan Wang☆14Updated last month
- Detect emotion from audio☆13Updated 6 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 5 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆29Updated last year
- Example implementation of Monotonic Chunkwise Attention.☆51Updated 6 years ago
- ☆20Updated 5 years ago
- Keyword spotting by Kaldi library☆26Updated 8 years ago
- Minimize kaldi nnet3 chain decoder☆45Updated 5 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- wake word spotting with kaldi☆19Updated 4 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- Deep Discriminative Embeddings for Duration Robust Speaker Verification☆19Updated 5 years ago
- ☆16Updated 5 years ago
- ☆31Updated 3 years ago
- ☆15Updated 5 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆36Updated 5 years ago
- Recurrent Neural Aligner☆49Updated 4 years ago
- Datasets of A Deep Convolutional Neural Network Based Virtual Elderly Companion Agent.☆34Updated 6 years ago
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆22Updated 4 years ago
- Code for the paper: Deep Residual Networks with Auditory Inspired Features for Robust Speech Recognition.☆21Updated 7 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- Old language modeling tool that's used in kaldi☆16Updated last year
- An implementation of RNN-Transducer loss in TF-2.0.☆45Updated last year
- CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统☆46Updated 6 years ago