Magic-Bubble / SpeechProcessForMachineLearning
用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现
☆52Updated 5 years ago
Alternatives and similar repositories for SpeechProcessForMachineLearning:
Users that are interested in SpeechProcessForMachineLearning are comparing it to the libraries listed below
- ☆142Updated 4 years ago
- 采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。☆70Updated 6 years ago
- 基于dVector的说话人识别keras☆88Updated 4 years ago
- 语音信号处理的基本知识☆36Updated 6 years ago
- ☆106Updated 4 years ago
- End-to-end speech recognition on AISHELL dataset.☆31Updated 3 years ago
- Encoder and Decoder and Attention Based Prosody Prediction☆69Updated 7 years ago
- Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplit…☆126Updated 4 years ago
- A librosa STFT/Fbank/mfcc feature extration written up in PyTorch using 1D Convolutions.☆75Updated 2 years ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆55Updated 4 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆65Updated 6 years ago
- ASR for Chinese Mandarin☆75Updated 6 years ago
- 基于卷积神经网络的语音识别声学模型的研究☆173Updated 5 years ago
- Use ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别☆43Updated 6 years ago
- Papers of ASR, Tools of ASR☆38Updated 2 months ago
- Seq2Seq Speech Recognition with Transformer on Mandarin Chinese☆116Updated 5 years ago
- Listen, Attend and Spell - PyTorch Implementation☆17Updated 6 years ago
- Data preparation for separation☆76Updated 3 years ago
- 利用webRTC对语音进行处理,实现VAD和 降噪处理☆50Updated 6 years ago
- speaker recognition using keras☆36Updated 2 years ago
- 未来杯语音赛道说话人识别的baseline☆48Updated 6 years ago
- Automatic Speech Recognition with TensorFlow(CNN+BLSTM+CTC)☆12Updated 6 years ago
- py-webrtcvad wrapper for trimming speech clips☆48Updated 2 years ago
- The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…☆118Updated 2 years ago
- 以音素建模构建NN-CTC声学模型☆15Updated 5 years ago
- 基于深度学习的语音增强、去混响☆91Updated last year
- PyTorch re-implementation of Speech-Transformer☆100Updated 3 years ago
- SpEx+(tied) source code☆82Updated last year
- ☆69Updated 4 years ago
- 分享在深蓝学院《语音识别:从入门到精通》第一期课程学习过程中完成的课后作业,供参考。☆21Updated 4 years ago