zhuzizyf / damo-fsmn-vad-infer-httpserverView external linksLinks
达摩fsmn vad c++推理服务
☆18Apr 17, 2023Updated 2 years ago
Alternatives and similar repositories for damo-fsmn-vad-infer-httpserver
Users that are interested in damo-fsmn-vad-infer-httpserver are comparing it to the libraries listed below
Sorting:
- silero-vad pytorch implement☆34Nov 23, 2024Updated last year
- ☆23Jul 17, 2024Updated last year
- noise reduction☆17Jul 3, 2024Updated last year
- ☆14Jul 11, 2022Updated 3 years ago
- TDY-CNN for text-independent speaker verification☆19Nov 7, 2022Updated 3 years ago
- Working on a dual-microphone noise reduction for mobile phone in noisy environment by Power Level Different Technique (PLD).☆17Jul 25, 2020Updated 5 years ago
- Exploring Binary Classification Loss for Speaker Verification☆18Jul 18, 2023Updated 2 years ago
- WeNet 实战课程作业☆20Oct 7, 2022Updated 3 years ago
- Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.☆24Aug 21, 2024Updated last year
- ☆33Nov 27, 2021Updated 4 years ago
- AudioStretchy is a Python wrapper around the `audio-stretch` C library, which performs fast, high-quality time-stretching of WAV/MP3 file…☆61Sep 24, 2025Updated 4 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆127Apr 26, 2023Updated 2 years ago
- some ncnn demos of FunASR☆28Sep 23, 2024Updated last year
- Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…☆27Mar 5, 2024Updated last year
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 10 months ago
- Tutorial for Ray☆36Mar 31, 2024Updated last year
- simple dnn based vad☆70Dec 2, 2018Updated 7 years ago
- ☆32Sep 14, 2022Updated 3 years ago
- Speech Emotion Recognition using Deep Learning☆12May 24, 2021Updated 4 years ago
- ☆38Oct 14, 2022Updated 3 years ago
- demo for webrtc agc2☆36Dec 25, 2021Updated 4 years ago
- rewrite python scipy.signal.lfilter in c code☆11Aug 13, 2019Updated 6 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 7 months ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- open-source Mandarian biased word dataset☆14Sep 21, 2023Updated 2 years ago
- OpenAI Whisper demo on Axera☆14Jan 15, 2026Updated 3 weeks ago
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 3 months ago
- muslx32 (musl libc and x32 abi) overlay for Gentoo Linux☆10Apr 21, 2021Updated 4 years ago
- c++的一些基础知识总结☆10Oct 28, 2020Updated 5 years ago
- Onset-and-Offset-Aware Sound Event Detection☆20Feb 10, 2025Updated last year
- Most Complete Pytorch Imeplementation "GENERALIZED END-TO-END LOSS FOR SPEAKER VERIFICATION"☆10Mar 11, 2020Updated 5 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 7 months ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- Four neural network architectures to classify sound source direction☆11Oct 3, 2020Updated 5 years ago
- fast_faceswap use dlib and change_style_network(基于dlib和风格迁移网络的快速换脸)☆11Jul 18, 2019Updated 6 years ago
- Unbounded cache model for online language modeling with open vocabulary☆11Feb 15, 2019Updated 6 years ago