这个工程的目的是从视频中获取语音识别的训练数据,用于训练字幕自动生成
☆53Aug 5, 2018Updated 7 years ago
Alternatives and similar repositories for from_video_get_ASR_traindata
Users that are interested in from_video_get_ASR_traindata are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Mandarin ASR system based on tensorflow☆108Aug 20, 2018Updated 7 years ago
- Using Baidu ASR auto-generating subtitles for any video file. 使用百度短语音识别技术为视频或音频生成字幕。☆12Jan 23, 2022Updated 4 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- Minimize kaldi nnet3 chain decoder☆45Jan 10, 2020Updated 6 years ago
- 利用文字信息生成文字动画视频☆17Apr 14, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for end-to-end ASR with neural networks, build with TensorFlow☆110Jan 24, 2019Updated 7 years ago
- CS224S / LINGUIST285 - Spoken Language Processing☆24Feb 13, 2020Updated 6 years ago
- A TensorFlow implementation for Chinese speech recognition based on DeepMind's WaveNet☆15Mar 27, 2018Updated 8 years ago
- 基于卷积神经网络的语音识别声学模型的研究☆181Jul 22, 2019Updated 6 years ago
- Chinese Prosodic Structure Prediction☆10May 18, 2019Updated 6 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- speex aec kalman filter☆15Mar 17, 2024Updated 2 years ago
- ☆13May 15, 2025Updated 11 months ago
- wechat服务端☆10Jun 17, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆23Mar 14, 2024Updated 2 years ago
- MultiSpeaker Tacotron2 using LifeLong Learning.☆13Sep 27, 2019Updated 6 years ago
- it's a train acoustics model code lib☆27May 20, 2020Updated 5 years ago
- implementing beamforming algorithm in C++☆11Jan 9, 2020Updated 6 years ago
- Tools for ASR Corpus Generation from Online Video☆138Feb 10, 2019Updated 7 years ago
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆37Dec 8, 2019Updated 6 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- The implementation of LSTM with projection layer by PyTorch☆17Sep 1, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Tensorflow version of DFSMN☆49Jul 17, 2018Updated 7 years ago
- Phone-level evaluation of L2 speakers (GOP algorithm)☆27Mar 1, 2017Updated 9 years ago
- ☆14Jun 19, 2019Updated 6 years ago
- Chinese Text Normalization and Dataset☆91May 14, 2022Updated 3 years ago
- An Automatic Speech Recognition Frame ,一个中文语音识别的完整框架, 提供了多个模型☆252Jan 6, 2021Updated 5 years ago
- [ECCV2022] Motion Sensitive Contrastive Learning for Self-supervised Video Representation☆17Aug 12, 2022Updated 3 years ago
- ☆12May 12, 2016Updated 9 years ago
- ☆11Aug 13, 2019Updated 6 years ago
- tensorflow speech synthesis c++ inference for voicenet☆16Mar 29, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆277Jan 15, 2021Updated 5 years ago
- Experiment with JNI access to some Kaldi functions.☆12Dec 31, 2018Updated 7 years ago
- https://github.com/ARM-software/ML-KWS-for-MCU☆14Jul 8, 2018Updated 7 years ago
- Next word prediction based on N-gram language model☆12Jan 11, 2015Updated 11 years ago
- ☆16Feb 7, 2014Updated 12 years ago
- This file is an implementation of the algorithm proposed in paper 'Phase-Based Dual-Microphone Robust Speech Enhancement'.☆18Aug 22, 2018Updated 7 years ago
- 基于BERT和指针网络构建实体抽取任务☆14Aug 2, 2020Updated 5 years ago