itaa / soja-box
A little useful toolbox for python.
☆78Updated 5 years ago
Alternatives and similar repositories for soja-box:
Users that are interested in soja-box are comparing it to the libraries listed below
- 采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。☆70Updated 6 years ago
- speaker recognition using keras☆36Updated 2 years ago
- 基于dVector的说话人识别keras☆90Updated 4 years ago
- ☆143Updated 4 years ago
- 用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现☆52Updated 5 years ago
- Chinese keyword spotting model using LSTM RNN☆174Updated 6 years ago
- 利用webRTC对语音进行处理,实现VAD和降噪处理☆51Updated 6 years ago
- 未来杯语音赛道说话人识别的baseline☆48Updated 6 years ago
- Region proposal network based small-footprint keyword spotting (Pytorch)☆54Updated last year
- Matlab implementation of the paper Noise Spectrum Estimation in Adverse Environments: Improved Minima Controlled Recursive Averaging☆73Updated 7 years ago
- ☆106Updated 4 years ago
- 基于深度学习的语音增强、去混响☆91Updated last year
- ASR for Chinese Mandarin☆75Updated 6 years ago
- A summary of speech data augment algorithms☆68Updated 4 years ago
- Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplit…☆126Updated 4 years ago
- Tensorflow version of DFSMN☆49Updated 6 years ago
- py-webrtcvad wrapper for trimming speech clips☆48Updated 2 years ago
- Voice Activity Detection LSTM-RNN learning model☆50Updated 7 years ago
- Denoise Speech (Enhanced Speech or Speech enhancement) by Deep Learning (Using Keras and Tensorflow)☆39Updated 7 years ago
- ☆35Updated 6 years ago
- [INTERSPEECH 2019] Waiting Update! This project is a demonstration of the paper UNetGAN: A Robust Speech Enhancement Approach in Time Dom…☆20Updated 6 years ago
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆73Updated 5 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆65Updated 6 years ago
- about Speech enhancement☆33Updated 6 years ago
- A unofficial Pytorch implementation of Microsoft's PHASEN☆228Updated last year
- Convolutional neural nets for single channel speech enhancement☆141Updated 4 years ago
- The code for aishell-3 baseline acoustic model☆67Updated 4 years ago
- ☆69Updated 4 years ago
- Simple DNN based Voice Activity Detection (VAD) using Pytorch☆40Updated 5 years ago
- 语音处理,声源定位中的一些基本特征☆50Updated 7 years ago