A Chinese speech recognition with autosub and deepspeech 在autosub上结合百度的deepspeech2模型实现中文语音识别
☆47Jul 6, 2023Updated 2 years ago
Alternatives and similar repositories for Autosub-with-Baidu-DeepSpeech2
Users that are interested in Autosub-with-Baidu-DeepSpeech2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- transformer的 encoder-decoder结构基于tensorflow实现的中文语音识别项目☆34Feb 24, 2021Updated 5 years ago
- 深蓝学院语音课程《语音识别从入门到精通》课程作业☆22Apr 2, 2020Updated 5 years ago
- cross-platform modular neural network inference library, small and efficient☆13May 15, 2023Updated 2 years ago
- Generates VCV/連続音/"triphone" recording lists and OTOs for UTAU☆16May 15, 2017Updated 8 years ago
- 使用BiLSTM对人民日报语料进行分词☆57Mar 3, 2019Updated 7 years ago
- 基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。☆759Dec 17, 2025Updated 3 months ago
- An Automatic Speech Recognition Frame ,一个中文语音识别的完整框架, 提供了多个模型☆252Jan 6, 2021Updated 5 years ago
- 封装了百度、捷通华声和讯飞语音识别的库,以及捷通华声、民族语文翻译、小牛翻译的封装。☆15Sep 10, 2019Updated 6 years ago
- Generate random material design background image.☆24Apr 6, 2017Updated 8 years ago
- The Android version of SlamAR.☆24Sep 8, 2017Updated 8 years ago
- A Pytorch implementation of triplet loss on VoxCeleb1☆12Oct 16, 2019Updated 6 years ago
- Code showing how to use a model based on the ML model base class.☆10Sep 30, 2022Updated 3 years ago
- use baidu voice-api to add subtitle to a vedio☆15Mar 17, 2019Updated 7 years ago
- stock Market Predicted for Kaggle-Sigma☆14Mar 26, 2019Updated 6 years ago
- Use ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别☆44Jul 10, 2018Updated 7 years ago
- Deep voice 3 + WORLD vocoder.☆17Jan 7, 2020Updated 6 years ago
- 代码迁移到 https://github.com/yutiansut/quantaxis☆25Jun 5, 2018Updated 7 years ago
- 自动填写问卷星表单☆10Mar 18, 2019Updated 7 years ago
- TTS前,文本标准化,将数字字母处理转化为汉字☆12Apr 27, 2024Updated last year
- 首届中文NL2SQL挑战赛复赛方案,评估数据集acc:0.85 复赛线上成绩: 0.833 Top15☆68Nov 29, 2020Updated 5 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆37Apr 12, 2018Updated 7 years ago
- A paper summary of Backdoor Attack against Neural Network☆13Aug 9, 2019Updated 6 years ago
- A toolbox for manipulating UTAU voicebank's mark file format(oto.ini)☆31May 2, 2022Updated 3 years ago
- 工业级中文语音识别系统电子书☆13Oct 30, 2020Updated 5 years ago
- DxOpenNI.dll for MikuMiku Dance, OpenNI 2 and NiTE 2 version☆16Jun 16, 2017Updated 8 years ago
- A framework using unity to achieve augmented reality based on OpenCV.☆14Sep 1, 2016Updated 9 years ago
- A Java wrapper for Raylib☆13Oct 22, 2018Updated 7 years ago
- jtalkDLL: OpenJTalk DLL☆20Mar 29, 2020Updated 5 years ago
- Code showing how to add metadata and versioning to an ML model base class.☆11Dec 8, 2022Updated 3 years ago
- 基于深度学习的中文分词尝试☆84Aug 27, 2015Updated 10 years ago
- Facial emotion recognition using TensorFlow☆20Apr 11, 2016Updated 9 years ago
- Dense visual SLAM for RGB-D cameras☆12Jun 27, 2016Updated 9 years ago
- 联通研究院-面向电信行业存量用户的智能套餐个性化匹配模型_复赛第6名☆50Dec 7, 2018Updated 7 years ago
- codes from the book “推荐系统开发实战”☆11Mar 19, 2020Updated 6 years ago
- 中文语音识别; Mandarin Automatic Speech Recognition;☆1,964Jul 25, 2024Updated last year
- Resampler for UTSU that works on Mac, Windows, and Linux☆41Jun 8, 2024Updated last year
- Code to reproduce "imagenet in 18 minutes" experiments☆19Mar 25, 2019Updated 6 years ago
- ☆19Jul 2, 2022Updated 3 years ago
- 端到端中文语音识别☆94Jul 25, 2024Updated last year