这个工程的目的是从视频中获取语音识别的训练数据,用于训练字幕自动生成
☆53Aug 5, 2018Updated 7 years ago
Alternatives and similar repositories for from_video_get_ASR_traindata
Users that are interested in from_video_get_ASR_traindata are comparing it to the libraries listed below
Sorting:
- Mandarin ASR system based on tensorflow☆108Aug 20, 2018Updated 7 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- Minimize kaldi nnet3 chain decoder☆45Jan 10, 2020Updated 6 years ago
- Using Baidu ASR auto-generating subtitles for any video file. 使用百度短语音识别技术为视频或音频生成字幕。☆12Jan 23, 2022Updated 4 years ago
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- 利用文字信息生成文字动画视频☆16Apr 14, 2022Updated 3 years ago
- A TensorFlow implementation for Chinese speech recognition based on DeepMind's WaveNet☆15Mar 27, 2018Updated 7 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- The implementation of LSTM with projection layer by PyTorch☆17Sep 1, 2019Updated 6 years ago
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆23Mar 14, 2024Updated last year
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- An implementation of rnn transducer for sequence labeling problem☆22Feb 24, 2018Updated 8 years ago
- 语音识别 语音前端处理 语音合成 语音转换等等语音技术的资料汇总☆23Nov 8, 2019Updated 6 years ago
- Tensorflow version of DFSMN☆49Jul 17, 2018Updated 7 years ago
- Complex Neural Beamformer☆32Oct 15, 2020Updated 5 years ago
- 基于卷积神经网络的语音识别声学模型的研究☆181Jul 22, 2019Updated 6 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- it's a train acoustics model code lib☆27May 20, 2020Updated 5 years ago
- A neural network consist of cnn and lstm for speech enhancement☆25Aug 2, 2018Updated 7 years ago
- CS224S / LINGUIST285 - Spoken Language Processing☆25Feb 13, 2020Updated 6 years ago
- ☆29Jun 15, 2022Updated 3 years ago
- Phone-level evaluation of L2 speakers (GOP algorithm)☆27Mar 1, 2017Updated 9 years ago
- An Automatic Speech Recognition Frame ,一个中文语音识别的完整框架, 提供了多个模型☆252Jan 6, 2021Updated 5 years ago
- Open Source WFST-based Decoder Toolkit☆77Feb 11, 2016Updated 10 years ago
- Some notes on Kaldi☆31Feb 20, 2015Updated 11 years ago
- ☆277Jan 15, 2021Updated 5 years ago
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Jun 8, 2022Updated 3 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆808Apr 6, 2023Updated 2 years ago
- In this project, we implemented a Subband Generalised Sidelobe Canceller(GSC) Beamforming algorithm for suppression of noise interfering …☆38Oct 8, 2017Updated 8 years ago
- NSNet2 Deep Noise Suppression (DNS) package☆39Sep 12, 2022Updated 3 years ago
- a kws demo on android☆40May 28, 2024Updated last year
- 双路视频拼接☆13Nov 13, 2022Updated 3 years ago
- ASR with PyTorch☆140Mar 10, 2019Updated 6 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆36Dec 8, 2019Updated 6 years ago
- CTC end -to-end ASR for timit and 863 corpus.☆219Dec 20, 2019Updated 6 years ago
- An audio classification system for learning with out-of-distribution data☆33Dec 8, 2022Updated 3 years ago
- DeepSpeech, Speech To Text, ASR, Speech recognition, Keras, Tensorflow☆30Jan 16, 2018Updated 8 years ago
- ☆35Apr 8, 2019Updated 6 years ago
- 一个简单的音频降噪工具,提高web UI界面和api接口☆44Nov 21, 2024Updated last year