zerozeus / audio-toyLinks
基于CNN的音频识别
☆17Updated 6 years ago
Alternatives and similar repositories for audio-toy
Users that are interested in audio-toy are comparing it to the libraries listed below
Sorting:
- 城市声音分类 Urban Sound Classification with TensorFlow Keras - MLP, RNN, CNN☆94Updated 6 years ago
- 语音识别 MFCCs特征处理 cnn神经网络☆100Updated 6 years ago
- A CNN audio classifier via spectrogram images.☆10Updated 8 years ago
- keras+tensorflow实现的各种神经网络☆86Updated 6 years ago
- 2018年7⽉30⽇-8⽉13⽇持续2周的好未来AI训练营中语⾳情感识别营的项目报告☆33Updated 6 years ago
- 语音降噪☆25Updated 7 years ago
- Implemented 3 neural network architectures: 1) Combination of RNN LSTM nodes and CNN, 2) CNN with residual blocks similar to ResNet, 3) D…☆25Updated 7 years ago
- SE-Resnet+AMSoftmax for Speaker Verification☆47Updated 6 years ago
- 2018年7⽉30⽇-8⽉13⽇持续2周的AI训练营中语⾳情感识别营的项目报告。☆95Updated 6 years ago
- Google Speech Command Dataset Classification Neural Network, CNN, RNN☆25Updated 8 years ago
- Use ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别☆43Updated 7 years ago
- 基于卷积神经网络的语音识别声学模型的研究☆176Updated 6 years ago
- 基于dVector的说话人识别keras☆90Updated 4 years ago
- Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features☆25Updated last year
- Code for our paper "Acoustic Features Fusion using Attentive Multi-channel Deep Architecture" in Keras and tensorflow☆26Updated 6 years ago
- Speaker recognition ,Voiceprint recognition☆53Updated 5 years ago
- 对音频文件的处理:音频信息,读取内容,获取时长,切割音频,pcm与wav互转☆38Updated 6 years ago
- solutions for https://www.kaggle.com/c/tensorflow-speech-recognition-challenge☆32Updated 7 years ago
- neural network and loss for asv implemented by PyTorch. (Triplet loss, LMCL, Angular Loss, Softmax)☆21Updated 5 years ago
- 这是一个基于全卷积神经网络的语音识别系统☆77Updated 6 years ago
- Emotion recognition of Speaker's Speech Data. Employ speaker detection classifiers for emotion recognition, a multiclass classification p…☆16Updated 10 years ago
- speaker recognition using keras☆36Updated 2 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆69Updated 2 years ago
- An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three ne…☆29Updated 6 years ago
- ☆15Updated 6 years ago
- A simple speech recognition using HMM (python)☆62Updated 11 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- voice active detection (python ver/simple and easy-to-use)☆12Updated 8 years ago
- Acoustic Scene Classification using transfer learning on VGGish pre-trained model☆11Updated 7 years ago
- Mandarin ASR system based on tensorflow☆108Updated 7 years ago