zhuzizyf/damo-fsmn-vad-infer-httpserver

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhuzizyf/damo-fsmn-vad-infer-httpserver)

zhuzizyf / damo-fsmn-vad-infer-httpserver

达摩fsmn vad c++推理服务

☆17

Alternatives and similar repositories for damo-fsmn-vad-infer-httpserver

Users that are interested in damo-fsmn-vad-infer-httpserver are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lovemefan / Silero-vad-pytorch
View on GitHub
silero-vad pytorch implement
☆38Nov 23, 2024Updated last year
wxqwinner / silero-vad-ncnn
View on GitHub
Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.
☆26Aug 21, 2024Updated last year
mmmgalleria / Dual-Microphone-Noise-Reduction-by-PLD-Technique
View on GitHub
Working on a dual-microphone noise reduction for mobile phone in noisy environment by Power Level Different Technique (PLD).
☆17Jul 25, 2020Updated 6 years ago
dengcunqin / noise-reduction
View on GitHub
noise reduction
☆17Jul 3, 2024Updated 2 years ago
shkim816 / temporal_dynamic_cnn
View on GitHub
TDY-CNN for text-independent speaker verification
☆19Nov 7, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
wenet-e2e / wenet_in_action_homework
View on GitHub
WeNet 实战课程作业
☆21Oct 7, 2022Updated 3 years ago
Hunterhuan / sphereface2_speaker_verification
View on GitHub
Exploring Binary Classification Loss for Speaker Verification
☆18Jul 18, 2023Updated 3 years ago
robin1001 / nn-vad
View on GitHub
simple dnn based vad
☆69Dec 2, 2018Updated 7 years ago
William1617 / gtcrn_c
View on GitHub
☆24Jul 17, 2024Updated 2 years ago
FeiGeChuanShu / FunASR-demo-ncnn
View on GitHub
some ncnn demos of FunASR
☆28Sep 23, 2024Updated last year
yucongzh / online_speaker_diarization
View on GitHub
☆15Jul 11, 2022Updated 4 years ago
daanzu / wenet_stt_python
View on GitHub
☆33Nov 27, 2021Updated 4 years ago
lovemefan / fsmn-vad
View on GitHub
A enterprise-grade Voice Activity Detector from modelscope and funasr.
☆139Apr 26, 2023Updated 3 years ago
pengzhendong / ngram-punctuator
View on GitHub
An N-gram punctuator for Chinese and English.
☆18Oct 14, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ctwgL / webrtc_agc2
View on GitHub
demo for webrtc agc2
☆36Dec 25, 2021Updated 4 years ago
adamsolomou / Speech-Enhancement
View on GitHub
Real-time speech enhancement based on spectral subtraction
☆16Feb 18, 2018Updated 8 years ago
twardoch / audiostretchy
View on GitHub
AudioStretchy is a Python wrapper around the `audio-stretch` C library, which performs fast, high-quality time-stretching of WAV/MP3 file…
☆60Jul 5, 2026Updated 3 weeks ago
cmdIrelia / music2dDemo
View on GitHub
MUSIC DOA estimation
☆14Feb 14, 2019Updated 7 years ago
AXERA-TECH / ONNX-YOLO-World-Open-Vocabulary-Object-Detection
View on GitHub
Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX. And Export the ONNX model for AXera's NPU
☆12Aug 11, 2025Updated 11 months ago
jzi040941 / webrtc_rnnvad
View on GitHub
webrtc_rnnvad
☆24Jul 12, 2021Updated 5 years ago
kivenyangming / OpencvSocket
View on GitHub
这是一个使用opencv读取视频并使用socket进行传输视频画面的脚本文件，相较于调用ffmpeg传输节约了90%的数据量
☆11May 14, 2024Updated 2 years ago
XiaoxiangGao / Dual_mic_phase_based_speech_enhancement
View on GitHub
This file is an implementation of the algorithm proposed in paper 'Phase-Based Dual-Microphone Robust Speech Enhancement'.
☆18Aug 22, 2018Updated 7 years ago
xingchensong / TouchNet
View on GitHub
A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp.
☆233Jul 2, 2026Updated 3 weeks ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Katock-Cricket / SAAI.Plugin
View on GitHub
GTA San Andreas with AI（ASI前端），将大语言模型(GPT)、TTS、SOVITS整合进入圣安地列斯，使用真正的AI控制NPC的行为、语音。
☆18Jul 27, 2025Updated 11 months ago
iduta / RealTime_dense_descriptors
View on GitHub
The sorce code for the realtime video descriptors: HOG, HOF, MBH and HMG
☆10Feb 6, 2017Updated 9 years ago
Mddct / simple-tts
View on GitHub
（WIP）long form speech generatoins
☆30Apr 2, 2025Updated last year
arezamoosavi / deepface-app
View on GitHub
Full stack data-science project
☆12Jan 13, 2022Updated 4 years ago
pengzhendong / asr-decoder
View on GitHub
CTC decoder with hotwords for ASR.
☆38Jun 15, 2026Updated last month
chenllliang / CTDNN
View on GitHub
MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition
☆11Dec 4, 2021Updated 4 years ago
leixing518 / webrtc_aec_x86
View on GitHub
单独移植编译webrtc的aec模块
☆22Aug 30, 2018Updated 7 years ago
Huiyicc / gpt_sovits_cpp
View on GitHub
GPT-Sovits的c++实现版本
☆22Jan 9, 2026Updated 6 months ago
xiaomi-research / tts-prism
View on GitHub
☆47Apr 27, 2026Updated 2 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
QEDan / links_clustering
View on GitHub
Implementation of the Links Online Clustering algorithm: https://arxiv.org/abs/1801.10123
☆30May 13, 2026Updated 2 months ago
mohsenmbcom / android-camera-face-detection-app
View on GitHub
Detecting faces using MLKit
☆10Aug 8, 2019Updated 6 years ago
amachang / system_status_bar_macos
View on GitHub
Library for interacting with the system's status bar for macOS
☆13Apr 28, 2024Updated 2 years ago
Qengineering / GFPGAN-ncnn-Raspberry-Pi-4
View on GitHub
GFPGAN face reconstruction with ncnn on a bare Raspberry Pi
☆14Jan 4, 2023Updated 3 years ago
wangwei2009 / coherence
View on GitHub
dual-mic noise reduction based on coherence function
☆54Dec 10, 2019Updated 6 years ago
thb1314 / ffmpeg-qt-openvino-rtmp
View on GitHub
☆13Dec 28, 2021Updated 4 years ago
kimsunwiub / BLOOM-Net
View on GitHub
Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"
☆14Feb 13, 2022Updated 4 years ago