lovemefan / Paraformer-webserver
paraformer web server build with sanic
☆24Updated last year
Alternatives and similar repositories for Paraformer-webserver:
Users that are interested in Paraformer-webserver are comparing it to the libraries listed below
- paraformer(chinense asr) online onnx runtime for python☆42Updated last year
- CTC decoder with hotwords for ASR.☆18Updated last week
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆94Updated last year
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆22Updated 11 months ago
- Python Wrapper of Silero VAD☆51Updated this week
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆66Updated 3 weeks ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆90Updated 7 months ago
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆39Updated 6 months ago
- flow mirror models from JZX AI Labs☆45Updated 6 months ago
- Chinese and English Bilinguish G2P☆20Updated last year
- g2p for english tts☆19Updated 2 years ago
- 单独维护的中文TTS☆35Updated 2 years ago
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆28Updated last year
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆19Updated 2 years ago
- ☆26Updated 2 months ago
- Huawei Grad-TTS for Chinese☆50Updated last year
- noise reduction☆17Updated 9 months ago
- ☆31Updated last month
- Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。☆13Updated 10 months ago
- ☆18Updated 5 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆17Updated last week
- ☆13Updated 2 years ago
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆93Updated 4 months ago
- ☆33Updated 3 years ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆96Updated last month
- ☆64Updated last year
- ☆20Updated 6 months ago
- ☆12Updated 2 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆16Updated last year
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated last year