Baidu-AIP / speech-vad-demoLinks
集成Webrtc的VAD,用于切分音频文件
☆341Updated 4 years ago
Alternatives and similar repositories for speech-vad-demo
Users that are interested in speech-vad-demo are comparing it to the libraries listed below
Sorting:
- Noise Suppression Module Port From WebRTC☆328Updated 4 years ago
- Tech☆83Updated 8 years ago
- Voice Activity Detector Module Port From WebRTC☆178Updated 5 years ago
- Recurrent neural network for audio noise reduction☆251Updated 4 years ago
- webrtc audio processing☆398Updated 5 years ago
- WebRTC AudioProc (AEC, VAD, NS...)☆104Updated 4 years ago
- 从webrtc抽离出来的vad源代码,供语音分析/检测使用☆30Updated 7 years ago
- 🔈 Use python to achieve voice activity detection, this little program may be helpful for voice application☆167Updated 7 years ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine☆551Updated last year
- Automatic Gain Control Module Port From WebRTC☆175Updated 6 years ago
- Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.☆232Updated 6 years ago
- An Automatic Speech Recognition Frame ,一个中文语音识别的完整框架, 提供了多个模型☆247Updated 4 years ago
- 百度云流式语音识别客户端 SDK☆78Updated last month
- VAD(voice activity detection) implement and using for baidu voice recognition☆63Updated 9 years ago
- Python bindings of WebRTC Audio Processing☆193Updated 2 months ago
- A demo repository for UniMRCP plugin implementation with iflytek ASR & TTS API☆132Updated 3 years ago
- speech recognition based on tensorflow 1.0.0☆140Updated 8 years ago
- 一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目,CPU上的实时率(RTF)小于0.1☆474Updated 4 months ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆405Updated 5 years ago
- Simple MFCC extractor and an speech recognition algorithm (Dynamic Time Warping)☆47Updated 7 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆302Updated 5 years ago
- This is WebRtc noise suppression module demo.☆102Updated 5 years ago
- A Demo of Mandarin/Chinese TTS frontend☆280Updated 3 years ago
- Kaldi model converter to ONNX☆244Updated 2 years ago
- Kaldi-based goodness of pronunciation (GOP)☆151Updated 4 years ago
- A Simple and Efficient Implementation Of Fast Fourier Transform For Audio Denoise☆107Updated 4 years ago
- VAD(Voice Activity Detector) python 实现对时时读入的流式数据进行端点检测☆49Updated 10 years ago
- A release version for https://github.com/athena-team/athena☆127Updated 2 years ago
- AEC3 Extracted From WebRTC☆180Updated 3 years ago
- 这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小…☆532Updated 2 years ago