Baidu-AIP / speech-demoView external linksLinks
语音api示例
☆710Jul 25, 2024Updated last year
Alternatives and similar repositories for speech-demo
Users that are interested in speech-demo are comparing it to the libraries listed below
Sorting:
- 实时语音识别API WebSocket☆156Jul 16, 2024Updated last year
- 集成Webrtc的VAD,用于切分音频文件☆343Aug 26, 2020Updated 5 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- Linux C++ demo☆38May 21, 2024Updated last year
- 百度AI平台RESTful API SDK调用的示例☆29Sep 3, 2019Updated 6 years ago
- 百度云流式语音识别客户端 SDK☆80Nov 13, 2025Updated 3 months ago
- 内容审核及速率限制服务☆26May 18, 2025Updated 8 months ago
- 🎙 online demo use text and lesson audio in unity☆12Feb 28, 2018Updated 7 years ago
- Vim Speech Recognition Experiments☆20May 30, 2025Updated 8 months ago
- Java Bindings for the C++ library DeepSpeech☆10Jun 4, 2020Updated 5 years ago
- media player for awtk☆11Feb 8, 2026Updated last week
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- Chinese text normalization. 中文文本规范化。☆60May 3, 2021Updated 4 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- ☆13Sep 28, 2018Updated 7 years ago
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆14Sep 4, 2019Updated 6 years ago
- A repository for Chinese text normalization.☆20May 2, 2021Updated 4 years ago
- A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统☆8,346Sep 6, 2025Updated 5 months ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆301Jun 15, 2020Updated 5 years ago
- a kws demo on android☆40May 28, 2024Updated last year
- Text frontend for ESPnet tts recipes☆34Jun 1, 2021Updated 4 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字(PocketSphinx、百度 API、科大讯飞 API)和文字转语音(pyttsx3)☆339Jun 3, 2019Updated 6 years ago
- Coqui Inference Engine☆40Aug 3, 2021Updated 4 years ago
- Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text fronten…☆12,530Jan 27, 2026Updated 2 weeks ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆15Dec 19, 2018Updated 7 years ago
- 📣 商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide …☆597May 15, 2024Updated last year
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆379Jul 21, 2022Updated 3 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Nov 28, 2021Updated 4 years ago
- Dart plugin wrapping the Sherpa-ONNX runtime. Contains example for speech recognition with Flutter☆22Jan 3, 2025Updated last year
- 这是一个用于连接小智AI服务的Python客户端库。它提供了简单的接口来进行语音对话和文本交互。☆26Mar 14, 2025Updated 11 months ago
- A simple TTS(text-to-speech) engine for Chinese mandarin☆21Feb 20, 2012Updated 13 years ago
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆21Jul 26, 2024Updated last year
- Python wrapper for kaldi's arpa2fst☆37Aug 27, 2025Updated 5 months ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Camera streaming on Android using ffmpeg, x264, live555, forked from https://github.com/parizene/android-streamer ,but some function re…☆11Aug 26, 2018Updated 7 years ago
- A CRF-based ASR Toolkit☆362Feb 5, 2026Updated last week