lovemefan / fsmn-vad
A enterprise-grade Voice Activity Detector from modelscope and funasr.
☆59Updated last year
Related projects ⓘ
Alternatives and complementary repositories for fsmn-vad
- Python Wrapper of Silero VAD☆41Updated last week
- Huawei Grad-TTS for Chinese☆45Updated last year
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English☆71Updated this week
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆73Updated 2 months ago
- ☆65Updated last year
- paraformer(chinense asr) online onnx runtime for python☆35Updated 7 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆43Updated 2 months ago
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆52Updated 2 months ago
- Pseudo Streaming SenseVoice with Hotwords☆73Updated last week
- Kaldi-compatible online fbank extractor without external dependencies☆78Updated 2 weeks ago
- Went online decode demo☆29Updated 3 years ago
- Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice☆128Updated 3 weeks ago
- ☆80Updated last year
- noise reduction☆17Updated 4 months ago
- 语音识别 论文 前沿☆43Updated 2 years ago
- paraformer web server build with sanic☆19Updated last year
- ASR教程: https://dataxujing.github.io/ASR-paper/☆23Updated 4 months ago
- Port of Funasr's Paraformer model in C/C++☆25Updated 4 months ago
- A library for adding punctuation into a text from ASR.☆17Updated last year
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆17Updated 6 months ago
- Chinese and English Bilinguish G2P☆20Updated last year
- ☆30Updated 3 years ago
- Target Speaker Extraction Toolkit☆105Updated this week
- 达摩fsmn vad c++推理服务☆11Updated last year
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆114Updated last week
- A ctc decoder for both online and offline asr model☆58Updated 11 months ago
- ☆34Updated 3 years ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆83Updated 9 months ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆74Updated last year