peilongchencc / My-FunASRLinks
基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。
☆48Updated last year
Alternatives and similar repositories for My-FunASR
Users that are interested in My-FunASR are comparing it to the libraries listed below
Sorting:
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆108Updated 4 months ago
- ☆69Updated last year
- ☆204Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆127Updated 2 years ago
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆152Updated 6 months ago
- IndexTTS Fine-tuning notebooks☆132Updated 7 months ago
- low-latency realtime ASR based on FireRedASR☆57Updated 7 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆183Updated 7 months ago
- TTS appalication based on modelscope KAN-TTS☆41Updated last year
- 中文标点符号模型,可以给文本添加标点符号。☆147Updated last year
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆83Updated 9 months ago
- Bert-VITS2项目bug多且教程不友好。本proj尽可能修复了Bert-vits2项目的bug,并且可一键启动训练。仅需50条目标说话人语音,获得稳定、快速的TTS模型。☆67Updated 5 months ago
- flow mirror models from JZX AI Labs☆43Updated last year
- 使用vllm加速cosyvoice2的推理☆481Updated 9 months ago
- Pseudo Streaming SenseVoice with Hotwords☆426Updated 10 months ago
- MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction mode…☆219Updated last year
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆30Updated last year
- ☆149Updated 2 years ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆20Updated last week
- 用 OCR 提取视频硬字幕☆88Updated 3 weeks ago
- paraformer(chinense asr) online onnx runtime for python☆53Updated last year
- 语音识别模型pytorch转ONNX转MNN,C++实现部署☆83Updated 3 years ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆172Updated last year
- Utilizes ONNX Runtime to transcribe audio into text.☆80Updated this week
- F5-TTS 推理加速,速度提升约4倍!☆122Updated last year
- 修复funasr中seaco-paraformer导出onnx后没有时间戳的bug☆24Updated last year
- Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.☆69Updated 3 weeks ago
- Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training wit…☆312Updated last month
- Utilizes ONNX Runtime for speech activity detection.☆41Updated 2 months ago
- Python Wrapper of Silero VAD☆64Updated 9 months ago