这个工程的目的是从视频中获取语音识别的训练数据,用于训练字幕自动生成
☆53Aug 5, 2018Updated 7 years ago
Alternatives and similar repositories for from_video_get_ASR_traindata
Users that are interested in from_video_get_ASR_traindata are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Mandarin ASR system based on tensorflow☆108Aug 20, 2018Updated 7 years ago
- Using Baidu ASR auto-generating subtitles for any video file. 使用百度短语音识别技术为视频或音频生成字幕。☆12Jan 23, 2022Updated 4 years ago
- Minimize kaldi nnet3 chain decoder☆45Jan 10, 2020Updated 6 years ago
- Code for end-to-end ASR with neural networks, build with TensorFlow☆110Jan 24, 2019Updated 7 years ago
- Complex Neural Beamformer☆33Oct 15, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- CS224S / LINGUIST285 - Spoken Language Processing☆24Feb 13, 2020Updated 6 years ago
- A TensorFlow implementation for Chinese speech recognition based on DeepMind's WaveNet☆15Mar 27, 2018Updated 8 years ago
- 基于卷积神经网络的语音识别声学模型的研究☆180Jul 22, 2019Updated 6 years ago
- Chinese Prosodic Structure Prediction☆10May 18, 2019Updated 7 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- wechat服务端☆10Jun 17, 2022Updated 4 years ago
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆23Mar 14, 2024Updated 2 years ago
- ☆13May 15, 2025Updated last year
- MultiSpeaker Tacotron2 using LifeLong Learning.☆13Sep 27, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- it's a train acoustics model code lib☆27May 20, 2020Updated 6 years ago
- implementing beamforming algorithm in C++☆11Jan 9, 2020Updated 6 years ago
- Tools for ASR Corpus Generation from Online Video☆139Feb 10, 2019Updated 7 years ago
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆38Dec 8, 2019Updated 6 years ago
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆13Apr 7, 2021Updated 5 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 4 years ago
- The implementation of LSTM with projection layer by PyTorch☆17Sep 1, 2019Updated 6 years ago
- Tensorflow version of DFSMN☆49Jul 17, 2018Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- a kws demo on android☆40May 28, 2024Updated 2 years ago
- Phone-level evaluation of L2 speakers (GOP algorithm)☆27Mar 1, 2017Updated 9 years ago
- An Automatic Speech Recognition Frame ,一个中文语音识别的完整框架, 提供了多个模型☆251Jan 6, 2021Updated 5 years ago
- [ECCV2022] Motion Sensitive Contrastive Learning for Self-supervised Video Representation☆17Aug 12, 2022Updated 3 years ago
- ☆12May 12, 2016Updated 10 years ago
- tensorflow speech synthesis c++ inference for voicenet☆16Mar 29, 2019Updated 7 years ago
- ☆11Aug 13, 2019Updated 6 years ago
- Experiment with JNI access to some Kaldi functions.☆12Dec 31, 2018Updated 7 years ago
- https://github.com/ARM-software/ML-KWS-for-MCU☆14Jul 8, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Next word prediction based on N-gram language model☆12Jan 11, 2015Updated 11 years ago
- implement end-to-end asr algorithm with tensorflow☆40Aug 23, 2018Updated 7 years ago
- This file is an implementation of the algorithm proposed in paper 'Phase-Based Dual-Microphone Robust Speech Enhancement'.☆18Aug 22, 2018Updated 7 years ago
- ☆16Feb 7, 2014Updated 12 years ago
- Weekly-review of BUCT Lab-614☆29Nov 25, 2017Updated 8 years ago
- 基于BERT和指针网络构建实体抽取任务☆14Aug 2, 2020Updated 5 years ago
- An implementation of rnn transducer for sequence labeling problem☆22Feb 24, 2018Updated 8 years ago