zerozeus / audio-toy
基于CNN的音频识别
☆17Updated 6 years ago
Alternatives and similar repositories for audio-toy:
Users that are interested in audio-toy are comparing it to the libraries listed below
- Use ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别☆43Updated 6 years ago
- 语音降噪☆25Updated 7 years ago
- A CNN audio classifier via spectrogram images.☆10Updated 7 years ago
- ☆15Updated 5 years ago
- 以音素建模构建NN-CTC声学模型☆15Updated 5 years ago
- Google Speech Command Dataset Classification Neural Network, CNN, RNN☆24Updated 7 years ago
- 基于dVector的说话人识别keras☆90Updated 4 years ago
- Code for our paper "Acoustic Features Fusion using Attentive Multi-channel Deep Architecture" in Keras and tensorflow☆26Updated 6 years ago
- solutions for https://www.kaggle.com/c/tensorflow-speech-recognition-challenge☆32Updated 7 years ago
- Implemented 3 neural network architectures: 1) Combination of RNN LSTM nodes and CNN, 2) CNN with residual blocks similar to ResNet, 3) D…☆25Updated 7 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆69Updated 2 years ago
- Acoustic Scene Classification using transfer learning on VGGish pre-trained model☆11Updated 7 years ago
- Using MFCC feature and DTW algorithm to recognize rumber 0-9☆18Updated 7 years ago
- SE-Resnet+AMSoftmax for Speaker Verification☆47Updated 6 years ago
- voice active detection (python ver/simple and easy-to-use)☆12Updated 8 years ago
- 城市声音分类 Urban Sound Classification with TensorFlow Keras - MLP, RNN, CNN☆95Updated 6 years ago
- Datasets of A Deep Convolutional Neural Network Based Virtual Elderly Companion Agent.☆35Updated 6 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- ☆15Updated 6 years ago
- A Python 2.7 implementation of Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) algorithms for Automated Speech …☆17Updated 7 years ago
- 方言分类,pytorch☆43Updated 6 years ago
- Training General-Purpose Audio Tagging Networks with Noisy Labels and Iterative Self-Verification☆29Updated 5 years ago
- 语音信号处理的基本知识☆36Updated 6 years ago
- Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features☆25Updated 9 months ago
- 基于卷积神经网络的语音识别声学模型的研究☆173Updated 5 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- Denoise Speech (Enhanced Speech or Speech enhancement) by Deep Learning (Using Keras and Tensorflow)☆39Updated 7 years ago
- This Repository includes four different implementations of the Speaker Verification task including the GMM_UBM, Ivector, Deep-Speaker, an…☆32Updated 6 years ago
- Listen, Attend and Spell - PyTorch Implementation☆17Updated 6 years ago
- 基于多特征融合模型音乐情感分类器的实现☆23Updated 6 years ago