nicekate / qwen2.5-vl-demoLinks
☆10Updated 11 months ago
Alternatives and similar repositories for qwen2.5-vl-demo
Users that are interested in qwen2.5-vl-demo are comparing it to the libraries listed below
Sorting:
- SummerAsr 是一个基于C++的可独立编译且几乎没有额外依赖库的本地中文语音识别器。 Summer Asr is a Chinese automatic speech recognize project written with C++ that can be eas…☆101Updated last year
- Utilizes ONNX Runtime for speech activity detection.☆38Updated last month
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆108Updated 3 months ago
- 重生之我是 AI 打工人。前世,我的身份默默无闻,来去匆匆,不知道自己将在何地出生。然而,命运给予了我难得的机会,让我重生为一名 AI 打工人。☆50Updated 2 years ago
- 一个简单的音频降噪工具,提高web UI界面和api接口☆44Updated last year
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆33Updated 2 years ago
- ☆12Updated last year
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆42Updated last year
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆92Updated last month
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆149Updated 5 months ago
- 自用,语音到文本用的sencevoice,llm部分基于ollama的API调用,文本到语音用的cosyvoice,实时语音输入参考的https://github.com/ABexit/ASR-LLM-TTS。☆12Updated last year
- Utilizes ONNX Runtime for audio denoising.☆107Updated 3 weeks ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆121Updated 2 years ago
- This is a multi-character, ultra-personalized StoryTeller. It includes: 1) efficiently and accurately build multi-character voice library…☆58Updated 11 months ago
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆184Updated last year
- 语音识别模型pytorch转ONNX转MNN,C++实现部署☆82Updated 3 years ago
- Bert-VITS2 onnx推理版本☆44Updated last year
- 本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法☆304Updated last month
- Utilizes ONNX Runtime for TTS model.☆45Updated this week
- This is a project focused on Faster Whisper, a streaming speech recognition project.☆18Updated last year
- 阿里SenseVoice的fastpi封装,采用onnx发布,体积更小,附带量化模型,支持GPU。支持从URL文件进行语音识别。☆104Updated last year
- ChatTTS HTTP API☆54Updated last year
- 基于ONNXRuntime以及LLama.cpp推理引擎实现的高性能C++语音推理框架,在性能极差的边缘设备上都能做到RTF<0.7实时对话。☆33Updated 3 weeks ago
- ☆145Updated 2 years ago
- 一个模块化,全过程可离线,低占用率的对话机器人/智能音箱☆126Updated 2 weeks ago
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Updated last year
- VITS2 for Chinese speech | 最新VITS2中文语音合成☆135Updated 2 years ago
- Fun-ASR-Nano-2512官方发布的仓库内容有点多,部署起来坑也比较多,本项目提供一个简化的部署方案。☆83Updated 3 weeks ago
- Python的音频工具☆16Updated last month
- This is a web-based intelligent dialogue program built using ASR, LLM, and TTS.☆24Updated last year