Magic-Bubble / SpeechProcessForMachineLearningLinks
用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现
☆52Updated 6 years ago
Alternatives and similar repositories for SpeechProcessForMachineLearning
Users that are interested in SpeechProcessForMachineLearning are comparing it to the libraries listed below
Sorting:
- 采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。☆70Updated 6 years ago
- 基于dVector的说话人识别keras☆90Updated 4 years ago
- ☆143Updated 4 years ago
- Seq2Seq Speech Recognition with Transformer on Mandarin Chinese☆116Updated 5 years ago
- 未来杯语音赛道说话人识别的baseline☆48Updated 6 years ago
- 语音信号处理的基本知识☆36Updated 6 years ago
- ☆106Updated 4 years ago
- PyTorch re-implementation of Speech-Transformer☆101Updated 3 years ago
- 基于卷积神经网络的语音识别声学模型的研究☆174Updated 5 years ago
- transformer for ASR-systerm (via tensorflow2.0)☆114Updated 6 years ago
- A pytorch based end2end speech recognition system.☆115Updated 4 years ago
- End-to-end speech recognition on AISHELL dataset.☆32Updated 3 years ago
- 基于深度学习的语音增强、去混响☆94Updated last year
- Encoder and Decoder and Attention Based Prosody Prediction☆68Updated 7 years ago
- Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplit…☆128Updated 4 years ago
- 利用webRTC对语音进行处理,实现VAD和降噪处理☆51Updated 6 years ago
- PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and …☆147Updated 5 years ago
- ☆55Updated 5 years ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆55Updated 4 years ago
- py-webrtcvad wrapper for trimming speech clips☆48Updated 3 years ago
- Region proposal network based small-footprint keyword spotting (Pytorch)☆55Updated last year
- A unofficial Pytorch implementation of Microsoft's PHASEN☆231Updated last year
- python codes to extract MFCC and FBANK speech features for Kaldi☆66Updated 6 years ago
- A statistical model-based Voice Activity Detection☆192Updated 6 years ago
- A summary of speech data augment algorithms☆69Updated 4 years ago
- 方言分类,pytorch☆43Updated 6 years ago
- 主要参考李宏毅老师2020年人类语言处理课程资料整理,包括代码和ppt☆34Updated 4 years ago
- ASR for Chinese Mandarin☆75Updated 7 years ago
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆376Updated 2 years ago
- Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)☆124Updated 2 years ago