达摩fsmn vad c++推理服务
☆18Apr 17, 2023Updated 2 years ago
Alternatives and similar repositories for damo-fsmn-vad-infer-httpserver
Users that are interested in damo-fsmn-vad-infer-httpserver are comparing it to the libraries listed below
Sorting:
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- ☆23Jul 17, 2024Updated last year
- noise reduction☆17Jul 3, 2024Updated last year
- ☆15Jul 11, 2022Updated 3 years ago
- Working on a dual-microphone noise reduction for mobile phone in noisy environment by Power Level Different Technique (PLD).☆17Jul 25, 2020Updated 5 years ago
- TDY-CNN for text-independent speaker verification☆19Nov 7, 2022Updated 3 years ago
- Exploring Binary Classification Loss for Speaker Verification☆18Jul 18, 2023Updated 2 years ago
- WeNet 实战课程作业☆20Oct 7, 2022Updated 3 years ago
- Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.☆24Aug 21, 2024Updated last year
- ☆33Nov 27, 2021Updated 4 years ago
- AudioStretchy is a Python wrapper around the `audio-stretch` C library, which performs fast, high-quality time-stretching of WAV/MP3 file…☆61Sep 24, 2025Updated 5 months ago
- some ncnn demos of FunASR☆28Sep 23, 2024Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆129Apr 26, 2023Updated 2 years ago
- Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…☆27Mar 5, 2024Updated 2 years ago
- Implementation of the Links Online Clustering algorithm: https://arxiv.org/abs/1801.10123☆30Oct 9, 2021Updated 4 years ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 11 months ago
- Tutorial for Ray☆36Mar 31, 2024Updated last year
- simple dnn based vad☆70Dec 2, 2018Updated 7 years ago
- ☆32Sep 14, 2022Updated 3 years ago
- Speech Emotion Recognition using Deep Learning☆12May 24, 2021Updated 4 years ago
- ☆39Oct 14, 2022Updated 3 years ago
- demo for webrtc agc2☆36Dec 25, 2021Updated 4 years ago
- WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models☆27Feb 13, 2026Updated 3 weeks ago
- rewrite python scipy.signal.lfilter in c code☆11Aug 13, 2019Updated 6 years ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆105Jan 10, 2025Updated last year
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 8 months ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- c++的一些基础知识总结☆10Oct 28, 2020Updated 5 years ago
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 8 months ago
- open-source Mandarian biased word dataset☆14Sep 21, 2023Updated 2 years ago
- Most Complete Pytorch Imeplementation "GENERALIZED END-TO-END LOSS FOR SPEAKER VERIFICATION"☆10Mar 11, 2020Updated 5 years ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- The sorce code for the realtime video descriptors: HOG, HOF, MBH and HMG☆10Feb 6, 2017Updated 9 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- fast_faceswap use dlib and change_style_network(基于dlib和风格迁移网络的快速换脸)☆11Jul 18, 2019Updated 6 years ago
- ☆10Nov 1, 2018Updated 7 years ago
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 4 months ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year