Nishanksingla / Caffe-Speaker-Recognition
CNN to recognize speaker on a spoken numbers dataset
☆18Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for Caffe-Speaker-Recognition
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆47Updated 7 years ago
- Speaker recognition and verification with deep learning☆13Updated 7 years ago
- "Automated Speech Recognition System" in Machine Learning and Having it Deep and Structured, Spring 2015☆20Updated 8 years ago
- Portal of Johannes and Felix's RNN implementation and further modifications for ASR☆21Updated 9 years ago
- TensorFlow Input Pipeline Examples based on multi-thread and FIFOQueue☆52Updated 7 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆86Updated 4 years ago
- TristouNet: Triplet Loss for Speaker Turn Embedding☆125Updated 7 years ago
- C\CPP implementation of Keyword Spotting, following the LSTM approach, based on Tensorflow☆9Updated 7 years ago
- Character level speech recognizer using ctc loss with deep rnns in TensorFlow.☆77Updated 6 years ago
- ☆71Updated 7 years ago
- Code for end-to-end ASR with neural networks, build with TensorFlow☆108Updated 5 years ago
- Google Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab☆45Updated 7 years ago
- Sound event detection in real life audio with CNN submitted to DCASE16☆22Updated 2 years ago
- A program for automatic speaker identification using deep learning techniques.☆84Updated 7 years ago
- Faster Deep Neural Networks☆36Updated 7 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 6 years ago
- A MXNet implementation of Baidu's DeepSpeech architecture☆83Updated 6 years ago
- Simple MXNet sequence-to-sequence model (neural machine translation)☆24Updated 6 years ago
- implement end-to-end asr algorithm with tensorflow☆40Updated 6 years ago
- Probabilistic Linear Discriminant Analysis☆14Updated 10 years ago
- ☆34Updated 7 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Updated 6 years ago
- Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).☆31Updated 5 years ago
- pytorch implementation of lyre.ai's char2wav model☆32Updated 7 years ago
- blstm-cws : Bi-directional LSTM for Chinese Word Segmentation☆46Updated 7 years ago
- Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).☆130Updated 3 years ago
- A Tensorflow implementation of the Tex2Vis neural network, tha applies a new StochasticLoss criterion to learn a mapping from textual des…☆20Updated 8 years ago