jingyonghou / RPN_KWSLinks
Region proposal network based small-footprint keyword spotting (Pytorch)
☆55Updated last year
Alternatives and similar repositories for RPN_KWS
Users that are interested in RPN_KWS are comparing it to the libraries listed below
Sorting:
- Mining effective negative training samples for keyword spotting (PyTorch)☆62Updated 5 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆66Updated 6 years ago
- about Speech enhancement☆33Updated 7 years ago
- ☆55Updated 5 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆36Updated 5 years ago
- ☆99Updated 7 years ago
- ☆129Updated 4 years ago
- ☆50Updated 4 years ago
- LSTM CTC End2End Speech Recognition.☆38Updated 6 years ago
- ☆35Updated 6 years ago
- 采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。☆70Updated 6 years ago
- Keyword Spotting for detecting a word in an audio file☆17Updated 6 years ago
- Speech enhancement system for the CHiME-5 dinner party scenario☆109Updated 6 months ago
- Minimize kaldi nnet3 chain decoder☆45Updated 5 years ago
- DCASE2020 Challenge Task 1 baseline system☆25Updated 5 years ago
- ☆54Updated 6 years ago
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆74Updated 5 years ago
- PyTorch implementation of a Time Delay Neural Network (TDNN)☆41Updated 6 years ago
- Code for DCASE 2020 task 1a and task 1b.☆87Updated 3 years ago
- py-webrtcvad wrapper for trimming speech clips☆48Updated 3 years ago
- Baseline of dcase 2019 task 4☆59Updated 2 years ago
- Implementation of the work presented in "CNN based Query by Example Spoken Term Detection"☆32Updated 6 years ago
- Universal Deep neural network based speech enhancement demo and tools, well pre-trained DNN model☆66Updated 2 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆59Updated 5 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆49Updated 6 years ago
- Deep learning based Speech Beamforming☆63Updated 7 years ago
- 为音频加混响的代码☆26Updated 2 years ago
- PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and …☆148Updated 5 years ago
- An LDA/PLDA estimator using KALDI in python for speaker verification tasks☆100Updated 8 years ago
- ☆41Updated 7 years ago