JKay0327 / whisper-TPU_pyLinks
A whisper repo for TPU
☆10Updated last year
Alternatives and similar repositories for whisper-TPU_py
Users that are interested in whisper-TPU_py are comparing it to the libraries listed below
Sorting:
- 适用于sophon bm1684x,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆13Updated last year
- ChatTTS is a generative speech model for daily dialogue.☆14Updated 9 months ago
- Text2speech & tone color conversion demo running on SG2300x 结合openvoice和emotivoice的TTS+即时克隆☆22Updated 9 months ago
- Stable Diffusion+LCM在SG2300X上,纵享丝滑一秒出图☆18Updated 8 months ago
- run ChatGLM2-6B in BM1684X☆49Updated last year
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆124Updated this week
- ☆59Updated last year
- flow mirror models from JZX AI Labs☆44Updated 10 months ago
- simplify >2GB large onnx model☆61Updated 8 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆97Updated 10 months ago
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆42Updated 10 months ago
- MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction mode…☆217Updated 7 months ago
- run chatglm3-6b in BM1684X☆40Updated last year
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆25Updated last year
- Pseudo Streaming SenseVoice with Hotwords☆332Updated 5 months ago
- ☆201Updated 10 months ago
- 使用vllm加速cosyvoice2的推理☆386Updated 3 months ago
- paraformer(chinense asr) online onnx runtime for python☆50Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆107Updated 2 years ago
- export llama to onnx☆131Updated 7 months ago
- Run generative AI models in sophgo BM1684X/BM1688☆232Updated this week
- Bert-VITS2项目bug多且教程不友好。本proj尽可能修复了Bert-vits2项目的bug,并且可一键启动训练。仅需50条目标说话人语音,获得稳定、快速的TTS模型。☆63Updated 5 months ago
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆75Updated 3 weeks ago
- ASR client for Triton ASR Service☆32Updated 8 months ago
- 基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏☆267Updated last year
- F5-TTS 推理加速,速度提升约4倍!☆104Updated 7 months ago
- ☆132Updated 2 years ago
- Port of Funasr's Paraformer model in C/C++☆33Updated last year
- 本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法☆281Updated 2 months ago
- Production First and Production Ready End-to-End Text-to-Speech Toolkit☆393Updated last year