lucky-bai / kaggle-speech-recognition
TensorFlow Speech Recognition Challenge (Top 15%)
☆14Updated 7 years ago
Alternatives and similar repositories for kaggle-speech-recognition:
Users that are interested in kaggle-speech-recognition are comparing it to the libraries listed below
- Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.☆72Updated 6 years ago
- Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challeng…☆58Updated 6 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆47Updated 8 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- Tensor2tensor experiment with SpecAugment☆46Updated 5 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆86Updated 4 years ago
- implement end-to-end asr algorithm with tensorflow☆40Updated 6 years ago
- Kaggle Freesound Audio Tagging 2019 Competition Solution☆28Updated 5 years ago
- ☆25Updated 7 years ago
- Code for the paper "Investigating the effect of residual and highway connections in speech enhancement models"☆11Updated 6 years ago
- ☆31Updated 6 years ago
- A program for automatic speaker identification using deep learning techniques.☆84Updated 8 years ago
- ☆30Updated 6 years ago
- Masked ConditionaL Neural Networks☆15Updated last year
- An implementation of Tacotron and Tacotron2☆81Updated 3 years ago
- "An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation☆19Updated 6 years ago
- Training General-Purpose Audio Tagging Networks with Noisy Labels and Iterative Self-Verification☆29Updated 5 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Updated 5 years ago
- "Automated Speech Recognition System" in Machine Learning and Having it Deep and Structured, Spring 2015☆20Updated 8 years ago
- WaveNet Vocoder Samples☆23Updated 5 years ago
- ☆33Updated 5 years ago
- 8th place solution (on Kaggle) to the Freesound General-Purpose Audio Tagging Challenge (DCASE 2018 - Task 2)☆113Updated 4 years ago
- DCASE2016 TASK1 Scene Classification☆12Updated 7 years ago
- Speaker embedding(verification and recognition) using Tensorflow with Kaldi☆41Updated 7 years ago
- Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)☆53Updated 5 years ago
- Audio data augmentation examples☆34Updated 6 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Updated 7 years ago
- Keyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.☆29Updated 7 years ago
- pytorch implementation of lyre.ai's char2wav model☆32Updated 7 years ago
- ☆56Updated 6 years ago