zerozeus / audio-toy
基于CNN的音频识别
☆17Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for audio-toy
- A CNN audio classifier via spectrogram images.☆10Updated 7 years ago
- 语音降噪☆23Updated 6 years ago
- 城市声音分类 Urban Sound Classification with TensorFlow Keras - MLP, RNN, CNN☆86Updated 5 years ago
- 语音信号处理的基本知识☆35Updated 5 years ago
- 以音素建模构建NN-CTC声学模型☆15Updated 5 years ago
- 2018年7⽉30⽇-8⽉13⽇持续2周的好未来AI训练营中语⾳情感识别营的项目报告☆32Updated 5 years ago
- Use ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别☆43Updated 6 years ago
- Implemented 3 neural network architectures: 1) Combination of RNN LSTM nodes and CNN, 2) CNN with residual blocks similar to ResNet, 3) D…☆25Updated 6 years ago
- ☆16Updated 5 years ago
- SE-Resnet+AMSoftmax for Speaker Verification☆48Updated 6 years ago
- keras+tensorflow实现的各种神经网络☆85Updated 5 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆66Updated 2 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- 语音识别 MFCCs特征处理 cnn神经网络☆96Updated 5 years ago
- 基于多特征融合模型音乐情感分类器的实现☆23Updated 6 years ago
- 基于卷积神经网络的语音识别声学模型的研究☆171Updated 5 years ago
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆38Updated 6 years ago
- Acoustic Scene Classification using transfer learning on VGGish pre-trained model☆11Updated 6 years ago
- Emotion recognition of Speaker's Speech Data. Employ speaker detection classifiers for emotion recognition, a multiclass classification p…☆16Updated 9 years ago
- 用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现☆50Updated 5 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- System for Emotion Detection in given speech data using joint modelling of hand crafted prosody rich features , MFCC features and LSTM ba…☆10Updated 7 years ago
- Evaluation of the classification performance (Speech, Music, and Noise) of 1D (WaveNet) and 2D (MobileNet) CNN and RNN (GRU) on the MUSAN…☆14Updated 4 years ago
- Datasets of A Deep Convolutional Neural Network Based Virtual Elderly Companion Agent.☆33Updated 6 years ago
- Speech Recognition with DFCNN and Transformer☆19Updated last year
- 基于Kaldi的小词汇量汉语语音识别,使用DNN训练☆27Updated 5 years ago
- 未来杯语音赛道说话人识别的baseline☆48Updated 5 years ago
- solutions for https://www.kaggle.com/c/tensorflow-speech-recognition-challenge☆32Updated 6 years ago