huschen/kaggle_speech_recognition

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/huschen/kaggle_speech_recognition)

huschen / kaggle_speech_recognition

Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.

☆72

Alternatives and similar repositories for kaggle_speech_recognition

Users that are interested in kaggle_speech_recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

changjenyin / DNN_HMM_RNN_speech
View on GitHub
"Automated Speech Recognition System" in Machine Learning and Having it Deep and Structured, Spring 2015
☆21Nov 25, 2016Updated 9 years ago
ShankHarinath / DeepSpeech2-Keras
View on GitHub
DeepSpeech, Speech To Text, ASR, Speech recognition, Keras, Tensorflow
☆30Jan 16, 2018Updated 8 years ago
hirofumi0810 / tensorflow_end2end_speech_recognition
View on GitHub
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
☆314Jan 23, 2018Updated 8 years ago
mdangschat / ctc-asr
View on GitHub
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
☆123Apr 15, 2020Updated 6 years ago
markusdr / transducersaurus
View on GitHub
Automatically exported from code.google.com/p/transducersaurus
☆11Apr 1, 2015Updated 11 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
PengdaLiu / LAS-SpeechRecognition
View on GitHub
Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).
☆32Jun 27, 2019Updated 7 years ago
vivianngo97 / Punctuation_Transcription
View on GitHub
A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.
☆15Aug 6, 2020Updated 5 years ago
philipperemy / tensorflow-ctc-speech-recognition
View on GitHub
Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
☆130Mar 4, 2021Updated 5 years ago
zw76859420 / ASR_WORD
View on GitHub
采用端到端方法构建声学模型，以字为建模单元，采用DCNN-CTC网络结构。
☆71Jan 26, 2019Updated 7 years ago
mobvoi / lstm_ctc
View on GitHub
LSTM CTC End2End Speech Recognition.
☆38Apr 2, 2019Updated 7 years ago
desh2608 / dnn-hmm-asr
View on GitHub
Hybrid DNN-HMM model for isolated digit recognition
☆32Dec 1, 2020Updated 5 years ago
WindQAQ / listen-attend-and-spell
View on GitHub
Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project utilizes input pipeline and estimator API …
☆90Jan 31, 2019Updated 7 years ago
xiaozhouwang / tensorflow_speech_recognition_solution
View on GitHub
code for 3rd place kaggle tensorflow competition
☆98Apr 12, 2018Updated 8 years ago
robmsmt / KerasDeepSpeech
View on GitHub
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
☆242Mar 17, 2018Updated 8 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
foamliu / Listen-Attend-Spell-v2
View on GitHub
PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).
☆39Jul 25, 2019Updated 7 years ago
andi611 / Conditional-SpecGAN-Tensorflow
View on GitHub
Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network
☆10Dec 12, 2018Updated 7 years ago
erogol / FFTNet
View on GitHub
FFTNet vocoder implementation
☆81Sep 28, 2018Updated 7 years ago
mravanelli / pytorch_MLP_for_ASR
View on GitHub
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…
☆40Feb 10, 2018Updated 8 years ago
bliunlpr / Robust_e2e_gan
View on GitHub
PyTorch implementation of "Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition"
☆19Jul 19, 2019Updated 7 years ago
ShigekiKarita / espnet-semi-supervised
View on GitHub
ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…
☆38Feb 13, 2020Updated 6 years ago
cpuimage / Transformer-TTS
View on GitHub
A Tensorflow Implementation like "Neural Speech Synthesis with Transformer Network" Port From OpenSeq2Seq
☆20Jul 6, 2023Updated 3 years ago
CSTR-Edinburgh / magphase
View on GitHub
MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.
☆80Oct 14, 2019Updated 6 years ago
jpuigcerver / kaldi-decoders
View on GitHub
Custom decoders for Kaldi
☆81Jun 10, 2019Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Totoketchup / Adaptive-MultiSpeaker-Separation
View on GitHub
Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem
☆50Jul 7, 2018Updated 8 years ago
placebokkk / pyfst
View on GitHub
A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)
☆17Apr 2, 2018Updated 8 years ago
chenzhehuai / kaldi-decoders
View on GitHub
Custom decoders for Kaldi
☆13Jun 5, 2019Updated 7 years ago
rizar / attention-lvcsr
View on GitHub
End-to-End Attention-Based Large Vocabulary Speech Recognition
☆265Nov 22, 2022Updated 3 years ago
skit-ai / kaldi-serve
View on GitHub
Server framework for Kaldi ASR Toolkit
☆99Sep 17, 2023Updated 2 years ago
tbright17 / accent-feat
View on GitHub
Feature extraction for accented-speech or pathological speech
☆18Apr 2, 2019Updated 7 years ago
minerva-ml / open-solution-cdiscount-starter
View on GitHub
Open solution to the Cdiscount’s Image Classification Challenge
☆18Jun 22, 2022Updated 4 years ago
s-omranpour / Pytorch-Speech-Recognition
View on GitHub
A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf
☆31Feb 10, 2022Updated 4 years ago
wangkenpu / rsrgan
View on GitHub
Robust Speech Recognition Using Generative Adversarial Networks (GAN)
☆59Nov 25, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
bjfu-ai-institute / speaker-recognition-papers
View on GitHub
Share some recent speaker recognition papers and their implementations.
☆89Sep 26, 2019Updated 6 years ago
ljuvela / multiscale-GAN
View on GitHub
Code for ICASSP 2019 paper
☆18Oct 29, 2018Updated 7 years ago
robin1001 / kaldi-aslp
View on GitHub
☆43Jun 25, 2018Updated 8 years ago
lifeiteng / Optimizers
View on GitHub
Tensorflow Optimizers
☆11Sep 1, 2019Updated 6 years ago
HawkAaron / RNN-Transducer
View on GitHub
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
☆140Jun 7, 2021Updated 5 years ago
vishalshar / SpeakerDiarization_RNN_CNN_LSTM
View on GitHub
Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…
☆64Jan 8, 2021Updated 5 years ago
datemoon / ASR-decoder
View on GitHub
it's ASR decoder and make graph project
☆33May 26, 2022Updated 4 years ago