lovemefan / fsmn-vad
A enterprise-grade Voice Activity Detector from modelscope and funasr.
☆66Updated last year
Related projects ⓘ
Alternatives and complementary repositories for fsmn-vad
- paraformer(chinense asr) online onnx runtime for python☆36Updated 7 months ago
- Huawei Grad-TTS for Chinese☆45Updated last year
- Python Wrapper of Silero VAD☆42Updated 3 weeks ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English☆74Updated this week
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆52Updated this week
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆77Updated 2 months ago
- ☆65Updated last year
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆18Updated 6 months ago
- paraformer web server build with sanic☆19Updated last year
- Pseudo Streaming SenseVoice with Hotwords☆88Updated 3 weeks ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆44Updated 3 months ago
- Went online decode demo☆29Updated 3 years ago
- Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice☆145Updated last month
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆26Updated 8 months ago
- Port of Funasr's Paraformer model in C/C++☆25Updated 5 months ago
- ☆34Updated 3 years ago
- Kaldi-compatible online fbank extractor without external dependencies☆80Updated 3 weeks ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆26Updated 2 weeks ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆84Updated 9 months ago
- Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.☆113Updated last week
- Target Speaker Extraction Toolkit☆113Updated 2 weeks ago
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆118Updated 3 weeks ago
- ☆82Updated last year
- ASR教程: https://dataxujing.github.io/ASR-paper/☆23Updated 4 months ago
- 语音识别 论文 前沿☆43Updated 2 years ago
- noise reduction☆17Updated 4 months ago
- Chinese and English Bilinguish G2P☆20Updated last year
- flow mirror models from JZX AI Labs☆40Updated last month
- ☆30Updated 3 years ago
- OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…☆61Updated 2 years ago