Baidu-AIP / speech-vad-demoLinks

集成Webrtc的VAD，用于切分音频文件

☆341

Alternatives and similar repositories for speech-vad-demo

Users that are interested in speech-vad-demo are comparing it to the libraries listed below

Sorting:

cpuimage / WebRTC_NS
Noise Suppression Module Port From WebRTC
☆328Updated 4 years ago
lifeiteng / codingmath
Tech
☆83Updated 8 years ago
cpuimage / WebRTC_VAD
Voice Activity Detector Module Port From WebRTC
☆178Updated 5 years ago
cpuimage / rnnoise
Recurrent neural network for audio noise reduction
☆251Updated 4 years ago
shichaog / WebRTC-audio-processing
webrtc audio processing
☆398Updated 5 years ago
DoubangoTelecom / webrtc-audioproc
WebRTC AudioProc (AEC, VAD, NS...)
☆104Updated 4 years ago
dreamno23 / vad
从webrtc抽离出来的vad源代码，供语音分析/检测使用
☆30Updated 7 years ago
wangshub / python-vad
🔈 Use python to achieve voice activity detection, this little program may be helpful for voice application
☆167Updated 7 years ago
dpirch / libfvad
Voice activity detection (VAD) library, based on WebRTC's VAD engine
☆551Updated last year
cpuimage / WebRTC_AGC
Automatic Gain Control Module Port From WebRTC
☆175Updated 6 years ago
tbright17 / kaldi-dnn-ali-gop
Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.
☆232Updated 6 years ago
sailist / ASRFrame
An Automatic Speech Recognition Frame ，一个中文语音识别的完整框架，提供了多个模型
☆247Updated 4 years ago
baidubce / pie
百度云流式语音识别客户端 SDK
☆78Updated last month
shiweixingcn / vad
VAD(voice activity detection) implement and using for baidu voice recognition
☆63Updated 9 years ago
xiongyihui / python-webrtc-audio-processing
Python bindings of WebRTC Audio Processing
☆193Updated 2 months ago
cotinyang / MRCP-Plugin-Demo
A demo repository for UniMRCP plugin implementation with iflytek ASR & TTS API
☆132Updated 3 years ago
Deeperjia / tensorflow-wavenet
speech recognition based on tensorflow 1.0.0
☆140Updated 8 years ago
Z-yq / TensorflowASR
一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目，CPU上的实时率(RTF)小于0.1
☆474Updated 4 months ago
open-speech / speech-aligner
speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…
☆405Updated 5 years ago
Linzecong / MFCC-DTW
Simple MFCC extractor and an speech recognition algorithm (Dynamic Time Warping)
☆47Updated 7 years ago
aishell-foundation / DaCiDian
DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)
☆302Updated 5 years ago
jagger2048 / WebRtc_noise_suppression
This is WebRtc noise suppression module demo.
☆102Updated 5 years ago
Jackiexiao / MTTS
A Demo of Mandarin/Chinese TTS frontend
☆280Updated 3 years ago
XiaoMi / kaldi-onnx
Kaldi model converter to ONNX
☆244Updated 2 years ago
jimbozhang / kaldi-gop
Kaldi-based goodness of pronunciation (GOP)
☆151Updated 4 years ago
cpuimage / SimpleAudioDenoise
A Simple and Efficient Implementation Of Fast Fourier Transform For Audio Denoise
☆107Updated 4 years ago
halleytl / pyvad
VAD(Voice Activity Detector) python 实现对时时读入的流式数据进行端点检测
☆49Updated 10 years ago
didi / athena
A release version for https://github.com/athena-team/athena
☆127Updated 2 years ago
ewan-xu / AEC3
AEC3 Extracted From WebRTC
☆180Updated 3 years ago
chenkui164 / FastASR
这是一个用C++实现ASR推理的项目，它依赖很少，安装也很简单，推理速度很快，在树莓派4B等ARM平台也可以流畅的运行。支持的模型是由Google的Transformer模型中优化而来，数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小…
☆532Updated 2 years ago