lezasantaizi/from_video_get_ASR_traindata

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lezasantaizi/from_video_get_ASR_traindata)

lezasantaizi / from_video_get_ASR_traindata

这个工程的目的是从视频中获取语音识别的训练数据，用于训练字幕自动生成

☆53

Alternatives and similar repositories for from_video_get_ASR_traindata

Users that are interested in from_video_get_ASR_traindata are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Pelhans / ZASR_tensorflow
View on GitHub
Mandarin ASR system based on tensorflow
☆108Aug 20, 2018Updated 7 years ago
igormq / ctcdecode-pytorch
View on GitHub
Python implementation of CTC beam search decoder + agnostic LM scorer
☆20Dec 16, 2020Updated 5 years ago
xiangxyq / minimize-chain-decoder
View on GitHub
Minimize kaldi nnet3 chain decoder
☆45Jan 10, 2020Updated 6 years ago
vrenkens / nabu
View on GitHub
Code for end-to-end ASR with neural networks, build with TensorFlow
☆110Jan 24, 2019Updated 7 years ago
sknadig / cs224s
View on GitHub
CS224S / LINGUIST285 - Spoken Language Processing
☆24Feb 13, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
CynthiaSuwi / Wavenet-demo
View on GitHub
A TensorFlow implementation for Chinese speech recognition based on DeepMind's WaveNet
☆15Mar 27, 2018Updated 8 years ago
zw76859420 / ASR_Syllable
View on GitHub
基于卷积神经网络的语音识别声学模型的研究
☆181Jul 22, 2019Updated 7 years ago
npujcong / Chinese_PSP
View on GitHub
Chinese Prosodic Structure Prediction
☆10May 18, 2019Updated 7 years ago
charlesliucn / LanMIT
View on GitHub
📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.
☆22Jul 12, 2019Updated 7 years ago
farhadmohsin / Robust-Adaptive-Beamforming
View on GitHub
☆14Jun 19, 2019Updated 7 years ago
datemoon / tf-code-acoustics
View on GitHub
it's a train acoustics model code lib
☆27May 20, 2020Updated 6 years ago
xcmyz / Lifelong-Learning-Tacotron2
View on GitHub
MultiSpeaker Tacotron2 using LifeLong Learning.
☆13Sep 27, 2019Updated 6 years ago
5yearsKim / beamforming
View on GitHub
implementing beamforming algorithm in C++
☆11Jan 9, 2020Updated 6 years ago
google-research-datasets / LLAMA1-Test-Set
View on GitHub
We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…
☆23Mar 14, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yc9701 / pansori
View on GitHub
Tools for ASR Corpus Generation from Online Video
☆140Feb 10, 2019Updated 7 years ago
danFromTelAviv / key_words_spotting
View on GitHub
implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"
☆38Dec 8, 2019Updated 6 years ago
i3thuan5 / hts_engine_python
View on GitHub
python wrap for hts engine
☆14Jan 30, 2018Updated 8 years ago
Ajyy / inverse_chinese_text_normalization
View on GitHub
将normalize过的中文文本，做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。
☆13Apr 7, 2021Updated 5 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
rrbluke / CNBF
View on GitHub
Complex Neural Beamformer
☆33Oct 15, 2020Updated 5 years ago
topel / goodness-of-pronunciation-HTK
View on GitHub
Phone-level evaluation of L2 speakers (GOP algorithm)
☆27Mar 1, 2017Updated 9 years ago
robin1001 / kws_on_android
View on GitHub
a kws demo on android
☆40May 28, 2024Updated 2 years ago
yangxueruivs / DFSMN
View on GitHub
Tensorflow version of DFSMN
☆49Jul 17, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sailist / ASRFrame
View on GitHub
An Automatic Speech Recognition Frame ，一个中文语音识别的完整框架，提供了多个模型
☆252Jan 6, 2021Updated 5 years ago
thuhcsi / FlatTN
View on GitHub
Chinese Text Normalization and Dataset
☆91May 14, 2022Updated 4 years ago
255BITS / vocal-autoencoder
View on GitHub
☆12May 12, 2016Updated 10 years ago
Marcovaldong / lstmp.pytorch
View on GitHub
The implementation of LSTM with projection layer by PyTorch
☆17Sep 1, 2019Updated 6 years ago
npuichigo / ttsflow
View on GitHub
tensorflow speech synthesis c++ inference for voicenet
☆16Mar 29, 2019Updated 7 years ago
cywang97 / StreamingTransformer
View on GitHub
☆277Jan 15, 2021Updated 5 years ago
ahmetaa / kaldi-jni
View on GitHub
Experiment with JNI access to some Kaldi functions.
☆12Dec 31, 2018Updated 7 years ago
cdyangbo / end2endASR
View on GitHub
implement end-to-end asr algorithm with tensorflow
☆40Aug 23, 2018Updated 7 years ago
zYeoman / ML-KWS-for-FPGA
View on GitHub
https://github.com/ARM-software/ML-KWS-for-MCU
☆16Jul 8, 2018Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
colaudiolab / AudioSet-R
View on GitHub
Official implementation: "AudioSet-R: A Refined AudioSet with Multi-Stage LLM Label Reannotation"
☆19Oct 9, 2025Updated 9 months ago
ododoyo / EHNet
View on GitHub
A neural network consist of cnn and lstm for speech enhancement
☆25Aug 2, 2018Updated 7 years ago
XiaoxiangGao / Dual_mic_phase_based_speech_enhancement
View on GitHub
This file is an implementation of the algorithm proposed in paper 'Phase-Based Dual-Microphone Robust Speech Enhancement'.
☆18Aug 22, 2018Updated 7 years ago
robmike / mvdr_beamformer
View on GitHub
☆16Feb 7, 2014Updated 12 years ago
mispchallenge / misp2021_baseline
View on GitHub
☆29Jun 15, 2022Updated 4 years ago
sequence-labeling / rnn-transducer
View on GitHub
An implementation of rnn transducer for sequence labeling problem
☆22Feb 24, 2018Updated 8 years ago
kaituoxu / Speech-Transformer
View on GitHub
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
☆810Apr 6, 2023Updated 3 years ago