lezasantaizi / from_video_get_ASR_traindataView external linksLinks
这个工程的目的是从视频中获取语音识别的训练数据,用于训练字幕自动生成
☆53Aug 5, 2018Updated 7 years ago
Alternatives and similar repositories for from_video_get_ASR_traindata
Users that are interested in from_video_get_ASR_traindata are comparing it to the libraries listed below
Sorting:
- Mandarin ASR system based on tensorflow☆108Aug 20, 2018Updated 7 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- Minimize kaldi nnet3 chain decoder☆45Jan 10, 2020Updated 6 years ago
- Using Baidu ASR auto-generating subtitles for any video file. 使用百度短语音识别技术为视频或音频生成字幕。☆12Jan 23, 2022Updated 4 years ago
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- [ECCV2022] Motion Sensitive Contrastive Learning for Self-supervised Video Representation☆17Aug 12, 2022Updated 3 years ago
- A TensorFlow implementation for Chinese speech recognition based on DeepMind's WaveNet☆15Mar 27, 2018Updated 7 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Code for end-to-end ASR with neural networks, build with TensorFlow☆110Jan 24, 2019Updated 7 years ago
- The implementation of LSTM with projection layer by PyTorch☆17Sep 1, 2019Updated 6 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- An implementation of rnn transducer for sequence labeling problem☆22Feb 24, 2018Updated 7 years ago
- 语音识别 语音前端处理 语音合成 语音转换等等语音技术的资料汇总☆22Nov 8, 2019Updated 6 years ago
- Tensorflow version of DFSMN☆49Jul 17, 2018Updated 7 years ago
- Complex Neural Beamformer☆32Oct 15, 2020Updated 5 years ago
- 基于卷积神经网络的语音识别声学模型的研究☆180Jul 22, 2019Updated 6 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- it's a train acoustics model code lib☆27May 20, 2020Updated 5 years ago
- Phone-level evaluation of L2 speakers (GOP algorithm)☆27Mar 1, 2017Updated 8 years ago
- ☆29Jun 15, 2022Updated 3 years ago
- CS224S / LINGUIST285 - Spoken Language Processing☆25Feb 13, 2020Updated 6 years ago
- An Automatic Speech Recognition Frame ,一个中文语音识别的完整框架, 提供了多个模型☆252Jan 6, 2021Updated 5 years ago
- Open Source WFST-based Decoder Toolkit☆77Feb 11, 2016Updated 10 years ago
- Python wrappers for Kaldi data☆33Sep 27, 2017Updated 8 years ago
- Some notes on Kaldi☆31Feb 20, 2015Updated 10 years ago
- ☆276Jan 15, 2021Updated 5 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆809Apr 6, 2023Updated 2 years ago
- In this project, we implemented a Subband Generalised Sidelobe Canceller(GSC) Beamforming algorithm for suppression of noise interfering …☆38Oct 8, 2017Updated 8 years ago
- a kws demo on android☆40May 28, 2024Updated last year
- NSNet2 Deep Noise Suppression (DNS) package☆39Sep 12, 2022Updated 3 years ago
- ASR with PyTorch☆140Mar 10, 2019Updated 6 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆36Dec 8, 2019Updated 6 years ago
- CTC end -to-end ASR for timit and 863 corpus.☆219Dec 20, 2019Updated 6 years ago
- DeepSpeech, Speech To Text, ASR, Speech recognition, Keras, Tensorflow☆30Jan 16, 2018Updated 8 years ago
- Tools for ASR Corpus Generation from Online Video☆140Feb 10, 2019Updated 7 years ago
- Chinese Text Normalization and Dataset☆90May 14, 2022Updated 3 years ago
- Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.☆235Apr 3, 2019Updated 6 years ago
- This is now the official location of the Kaldi project.☆10Aug 22, 2019Updated 6 years ago
- A simple cycle-accurate DaDianNao simulator☆13Mar 27, 2019Updated 6 years ago