yh1008/speech-to-text

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yh1008/speech-to-text)

yh1008 / speech-to-text

mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras

☆71

Alternatives and similar repositories for speech-to-text

Users that are interested in speech-to-text are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JRMeyer / multi-task-kaldi
View on GitHub
An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…
☆55Jan 2, 2020Updated 6 years ago
irebai / SpecAugment_KALDI
View on GitHub
A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆15Sep 4, 2019Updated 6 years ago
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
datemoon / ASR-decoder
View on GitHub
it's ASR decoder and make graph project
☆33May 26, 2022Updated 4 years ago
hirofumi0810 / asr_preprocessing
View on GitHub
Python implementation of pre-processing for End-to-End speech recognition
☆70Feb 19, 2018Updated 8 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
JRMeyer / easy-kaldi
View on GitHub
Use your data to create a speech recognition system in Kaldi. Fast.
☆65Jan 2, 2020Updated 6 years ago
someonefighting / tf-kaldi-speaker-master
View on GitHub
☆17Jun 30, 2020Updated 6 years ago
kaituoxu / kaldi-ktnet1
View on GitHub
Kaldi extended by Kaituo XU with new features in nnet1.
☆12Dec 16, 2018Updated 7 years ago
navana-tech / baseline_recipe_is21s_indic_asr_challenge
View on GitHub
Multilingual and code-switching ASR challenges for low resource Indian languages.
☆23Jul 26, 2021Updated 4 years ago
dspavankumar / keras-kaldi
View on GitHub
Keras Interface for Kaldi ASR
☆122Sep 27, 2017Updated 8 years ago
miras-tech / MirasVoice
View on GitHub
MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algor…
☆19Mar 15, 2020Updated 6 years ago
igormq / asr-study
View on GitHub
Implementation of all-neural speech recognition systems using Keras and Tensorflow
☆146Oct 12, 2017Updated 8 years ago
vrenkens / nabu
View on GitHub
Code for end-to-end ASR with neural networks, build with TensorFlow
☆110Jan 24, 2019Updated 7 years ago
aishell-foundation / DaCiDian
View on GitHub
DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)
☆301Jun 15, 2020Updated 6 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
mdangschat / ctc-asr
View on GitHub
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
☆123Apr 15, 2020Updated 6 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
chenzhehuai / kaldi-decoders
View on GitHub
Custom decoders for Kaldi
☆13Jun 5, 2019Updated 7 years ago
idiap / pkwrap
View on GitHub
A pytorch wrapper for LF-MMI training and parallel training in Kaldi
☆73Jun 8, 2022Updated 4 years ago
JarbasAl / kaldi_spotter
View on GitHub
wake word spotting with kaldi
☆19Dec 3, 2020Updated 5 years ago
robin1001 / kws_on_android
View on GitHub
a kws demo on android
☆40May 28, 2024Updated 2 years ago
asappresearch / multistream-cnn
View on GitHub
Multistream CNN for Robust Acoustic Modeling
☆40Jun 17, 2021Updated 5 years ago
ffxiong / uaspeech
View on GitHub
Baseline kaldi script for UA-SPEECH corpus
☆32Oct 16, 2024Updated last year
qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
moisesveleta / GOP-LSTM
View on GitHub
Improving the Goodness of Pronunciation with DNNs and RNNs
☆32Sep 26, 2018Updated 7 years ago
pilot7747 / VoxDIY
View on GitHub
This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.
☆16Jul 22, 2021Updated 4 years ago
zldzmfoq12 / VCtube
View on GitHub
A pakage for crawling audio from Youtube
☆42Aug 8, 2023Updated 2 years ago
pengzhendong / welm
View on GitHub
One command to build TLG.fst for WeNet.
☆30Oct 11, 2022Updated 3 years ago
hirofumi0810 / tensorflow_end2end_speech_recognition
View on GitHub
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
☆314Jan 23, 2018Updated 8 years ago
revdotcom / words2num
View on GitHub
Convert words to numbers
☆21Apr 13, 2022Updated 4 years ago
oxinabox / Kaldi-Notes
View on GitHub
Some notes on Kaldi
☆32Feb 20, 2015Updated 11 years ago
tbright17 / kaldi-dnn-ali-gop
View on GitHub
Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.
☆236Apr 3, 2019Updated 7 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jpuigcerver / kaldi-decoders
View on GitHub
Custom decoders for Kaldi
☆80Jun 10, 2019Updated 7 years ago
YiwenShaoStephen / pychain
View on GitHub
PyTorch implementation of LF-MMI for End-to-end ASR
☆221Jan 14, 2021Updated 5 years ago
funcwj / setk
View on GitHub
Tools for Speech Enhancement integrated with Kaldi
☆431Jul 6, 2023Updated 3 years ago
motazsaad / ara-pronunciation-tool
View on GitHub
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …
☆15Sep 5, 2017Updated 8 years ago
srvk / eesen
View on GitHub
The official repository of the Eesen project
☆834May 23, 2019Updated 7 years ago
joaoantoniocn / AM-MobileNet1D
View on GitHub
The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…
☆31Oct 3, 2023Updated 2 years ago
findnitai / TDNN-layer
View on GitHub
A keras layer implementation of Peddinti's paper "A time delay neural network architecture for efficient modeling of long temporal conte…
☆13Nov 19, 2018Updated 7 years ago