786440445 / ASR_DFCNN_Transformer
1. ctc的DCNN声学模型+语言模型和 transformer的端到端模型
☆8Updated 2 years ago
Alternatives and similar repositories for ASR_DFCNN_Transformer:
Users that are interested in ASR_DFCNN_Transformer are comparing it to the libraries listed below
- 基于卷积神经网络的语音识别声学模型的研究☆172Updated 5 years ago
- Automatic Speech Recognition with TensorFlow(CNN+BLSTM+CTC)☆12Updated 6 years ago
- 用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现☆52Updated 5 years ago
- ☆142Updated 4 years ago
- Audio Split 基于双门限法的语音端点检测及语音分割☆132Updated 4 years ago
- 基于深度学习的语音增强、去混响☆89Updated last year
- Use ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别☆43Updated 6 years ago
- ☆106Updated 3 years ago
- Data preparation for separation☆76Updated 3 years ago
- ASR中文语音识别☆33Updated 5 years ago
- 利用Python+TensorFlow实现语音识别☆47Updated 6 years ago
- 说话人特征(声纹)提取工具,基于VGG-SR预训练模型。☆33Updated 4 years ago
- 基于dVector的说话人识别keras☆87Updated 4 years ago
- Speech Recognition with DFCNN and Transformer☆18Updated 2 years ago
- 采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。☆71Updated 6 years ago
- Encoder and Decoder and Attention Based Prosody Prediction☆68Updated 7 years ago
- End-to-end speech recognition on AISHELL dataset.☆31Updated 3 years ago
- ☆15Updated 2 years ago
- 基于HMM与MFCC特征进行数字0-9的语音识别,HMM,GMMHMM,MFCC,语音识别,sklearn,Digital Voice Recognition。☆16Updated 2 years ago
- Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)☆122Updated last year
- ☆50Updated 4 years ago
- Seq2Seq Speech Recognition with Transformer on Mandarin Chinese☆116Updated 5 years ago
- 基于gan的语音增强☆15Updated 6 years ago
- 这是一个基于全卷积神经网络的语音识别系统☆77Updated 5 years ago
- tacotron-2(pytorch) + melgan(pytorch) chinese TTS☆26Updated last year
- ☆11Updated 2 years ago
- PyTorch implementation of "Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition"☆19Updated 5 years ago
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆375Updated 2 years ago
- Papers of ASR, Tools of ASR☆39Updated last week
- Acoustic feature extraction using Librosa library and openSMILE toolkit.使用Librosa音频处理库和openSMILE工具包,进行简单的声学特征提取☆192Updated 4 years ago