Baidu-AIP / speech-vad-demo
集成Webrtc的VAD,用于切分音频文件
☆341Updated 4 years ago
Alternatives and similar repositories for speech-vad-demo:
Users that are interested in speech-vad-demo are comparing it to the libraries listed below
- Noise Suppression Module Port From WebRTC☆320Updated 4 years ago
- webrtc audio processing☆390Updated 4 years ago
- Tech☆82Updated 8 years ago
- Voice Activity Detector Module Port From WebRTC☆172Updated 4 years ago
- Recurrent neural network for audio noise reduction☆245Updated 4 years ago
- Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.☆229Updated 6 years ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine☆534Updated last year
- Automatic Gain Control Module Port From WebRTC☆173Updated 6 years ago
- A Demo of Mandarin/Chinese TTS frontend☆278Updated 2 years ago
- Python bindings of WebRTC Audio Processing☆188Updated 7 months ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆301Updated 4 years ago
- VAD(voice activity detection) implement and using for baidu voice recognition☆63Updated 9 years ago
- 一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目,CPU上的实时率(RTF)小于0.1☆473Updated last month
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆399Updated 5 years ago
- Kaldi-based goodness of pronunciation (GOP)☆149Updated 4 years ago
- 🔈 Use python to achieve voice activity detection, this little program may be helpful for voice application☆168Updated 7 years ago
- WebRTC AudioProc (AEC, VAD, NS...)☆102Updated 4 years ago
- 从webrtc抽离出来的vad源代码,供语音分析/检测使用☆30Updated 7 years ago
- An Automatic Speech Recognition Frame ,一个中文语音识别的完整框架, 提供了多个模型☆246Updated 4 years ago
- A release version for https://github.com/athena-team/athena☆126Updated 2 years ago
- AEC3 Extracted From WebRTC☆174Updated 3 years ago
- 百度云流式语音识别客户端 SDK☆78Updated last week
- 这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小…☆508Updated 2 years ago
- Production First and Production Ready End-to-End Text-to-Speech Toolkit☆386Updated 10 months ago
- VAD(Voice Activity Detector) python 实现对时时读入的流式数据进行端点检测☆49Updated 10 years ago
- Kaldi model converter to ONNX☆241Updated 2 years ago
- ☆86Updated 4 months ago
- Mandarin ASR system based on tensorflow☆108Updated 6 years ago
- 超快的中文普通话TTS☆120Updated 4 years ago
- Acoustic Echo Canceller for Mobile Module Port From WebRTC☆190Updated last year