Anionex / MiniCPM-o-2.6-int4_Windows_x64_cudaLinks
(整合包Integrated package)一键使用面壁智能最新的MiniCPM-o 2.6多模态模型,用于视频对话、语音对话和文字对话。|Use Modelbest's latest MiniCPM-o 2.6 multi-modal model with one click for video conversations, voice conversations and text conversations within 8g vram.
☆15Updated 5 months ago
Alternatives and similar repositories for MiniCPM-o-2.6-int4_Windows_x64_cuda
Users that are interested in MiniCPM-o-2.6-int4_Windows_x64_cuda are comparing it to the libraries listed below
Sorting:
- FastAPI Server Implementation for Bilibili Index TTS☆26Updated 8 months ago
- 小智机器人服务端☆18Updated 9 months ago
- 异步语音对话组件。☆31Updated 9 months ago
- VideoFinder is an advanced video analysis tool powered by multimodal AI, designed to help users easily locate and identify specific objec…☆167Updated last year
- 一个用于F5-TTS的api和webui项目☆64Updated last year
- ☆58Updated last year
- 一个用于CosyVoice的api接口项目☆327Updated 3 months ago
- 实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果☆429Updated 11 months ago
- An common framework for voice and text interactions with LLMs☆99Updated last year
- 在DH_live项目基础上修改,添加webui界面☆72Updated 8 months ago
- 基于SenseVoice的funasr版本进行的api发布,可以无缝对接oneapi☆89Updated last year
- This project provides a production-ready, real-time inference server for LatentSync, enabling high-quality, low-latency 2D digital human …☆21Updated 4 months ago
- 开源的LstmSync数字人泛化模型,只做最好的泛化模型!☆133Updated this week
- 文本语料转训练集工具,txt转dataset☆93Updated last year
- 洛曦 数字人视频播放器,带HTTP API,使用gradio api对接Easy-Wav2Lip、Sadtalker、GeneFacePlusPlus、MuseTalk,也可以用于播放本地视频☆175Updated last year
- ChatTTS HTTP API☆54Updated last year
- 基于FunASR官方Demo修改的WS服务端,配合FastAPI提供HTTP服务,可以在浏览器中进行实时ASR测试☆45Updated 4 months ago
- ☆33Updated 10 months ago
- 基于Linly-Talker数字人改版的教育系统,包含网课总结、数字人对话、Chatbot对话,项目可在autodl部署☆34Updated last year
- 封装GPT-Sovits-Interface,可用于用于多角色多情感有声中文小说制作☆35Updated last year
- 阿里SenseVoice的fastpi封装,采用onnx发布,体积更小,附带量化模型,支持GPU。支持从URL文件进行语音识别。☆104Updated last year
- 低成本的简单基于live2d TTS文字转语音和大模型聊天的直播解决方案☆273Updated last year
- Sample GLM4V + ChatTTS AI assistant☆85Updated last year
- ComfyUI界面汉化 中文简体版☆12Updated last year
- a framework combining abilities of QwenVL and Deepseek Apis to enable a visual interaction using deepseek model.☆113Updated 9 months ago
- 跨语种语音克隆,中文版Webui☆61Updated last year
- 一个语音识别项目☆49Updated 7 months ago
- CosyVoice2 功能扩充(预训练音色推理/3s极速复刻/自然语言控制/自动识别/音色模型保存/API)☆182Updated 9 months ago
- MaskGCT-Windows For Windows Users☆66Updated 7 months ago
- ☆51Updated last year