JKay0327 / whisper-TPU_pyLinks
A whisper repo for TPU
☆10Updated last year
Alternatives and similar repositories for whisper-TPU_py
Users that are interested in whisper-TPU_py are comparing it to the libraries listed below
Sorting:
- 适用于sophon bm1684x,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆14Updated last year
- Text2speech & tone color conversion demo running on SG2300x 结合openvoice和emotivoice的TTS+即时克隆☆22Updated 11 months ago
- ChatTTS is a generative speech model for daily dialogue.☆14Updated last year
- run ChatGLM2-6B in BM1684X☆50Updated last year
- qwen2 and llama3 cpp implementation☆47Updated last year
- Stable Diffusion+LCM在SG2300X上,纵享丝滑一秒出图☆18Updated 10 months ago
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆139Updated 2 months ago
- simplify >2GB large onnx model☆63Updated 10 months ago
- export llama to onnx☆136Updated 9 months ago
- ☆64Updated last year
- flow mirror models from JZX AI Labs☆44Updated last year
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆45Updated last year
- run chatglm3-6b in BM1684X☆40Updated last year
- Run generative AI models in sophgo BM1684X/BM1688☆251Updated this week
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆104Updated 3 weeks ago
- MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction mode…☆217Updated 9 months ago
- ☆124Updated last year
- 语音识别模型pytorch转ONNX转MNN,C++实现部署☆76Updated 3 years ago
- ☆204Updated last year
- ☆138Updated 2 years ago
- Pseudo Streaming SenseVoice with Hotwords☆366Updated 7 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆114Updated 2 years ago
- paraformer(chinense asr) online onnx runtime for python☆53Updated last year
- 使用SG2300X实现无瑕疵换脸☆32Updated last year
- 使用vllm加速cosyvoice2的推理☆430Updated 6 months ago
- llm-export can export llm model to onnx.☆314Updated last month
- ☆72Updated 2 years ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆50Updated 2 years ago
- Bert-VITS2项目bug多且教程不友好。本proj尽可能修复了Bert-vits2项目的bug,并且可一键启动训练。仅需50条目标说话人语音,获得稳定、快速的TTS模型。☆65Updated 2 months ago
- 本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法☆292Updated 4 months ago