Anionex / MiniCPM-o-2.6-int4_Windows_x64_cudaLinks
(整合包Integrated package)一键使用面壁智能最新的MiniCPM-o 2.6多模态模型,用于视频对话、语音对话和文字对话。|Use Modelbest's latest MiniCPM-o 2.6 multi-modal model with one click for video conversations, voice conversations and text conversations within 8g vram.
☆15Updated 6 months ago
Alternatives and similar repositories for MiniCPM-o-2.6-int4_Windows_x64_cuda
Users that are interested in MiniCPM-o-2.6-int4_Windows_x64_cuda are comparing it to the libraries listed below
Sorting:
- FastAPI Server Implementation for Bilibili Index TTS☆25Updated 9 months ago
- 小智机器人服务端☆18Updated 9 months ago
- 异步语音对话组件。☆32Updated 10 months ago
- VideoFinder is an advanced video analysis tool powered by multimodal AI, designed to help users easily locate and identify specific objec…☆169Updated last year
- 文本语料转训练集工具,txt转dataset☆93Updated last year
- 使用 FastAPI、Streamlit本地部署ChatTTS文本转语音模型,并通过 Docker Compose 进行容器化部署。☆27Updated last year
- 一个用于CosyVoice的api接口项目☆332Updated 4 months ago
- 在DH_live项目基础上修改,添加webui界面☆72Updated 8 months ago
- 一个用于F5-TTS的api和webui项目☆65Updated last year
- 小智同学测试工具(websocket)☆48Updated 10 months ago
- GOT-OCR的GUI版本,提供OCR、导出PDF、批处理等功能,但不提供训练功能☆181Updated 2 months ago
- ☆58Updated last year
- 洛曦 数字人视频播放器,带HTTP API,使用gradio api对接Easy-Wav2Lip、Sadtalker、GeneFacePlusPlus、MuseTalk,也可以用于播放本地视频☆175Updated last year
- 实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果☆430Updated last year
- An common framework for voice and text interactions with LLMs☆99Updated last year
- ChatTTS HTTP API☆54Updated last year
- This project provides a production-ready, real-time inference server for LatentSync, enabling high-quality, low-latency 2D digital human …☆21Updated 5 months ago
- a framework combining abilities of QwenVL and Deepseek Apis to enable a visual interaction using deepseek model.☆113Updated 10 months ago
- CosyVoice语音合成简易API☆14Updated last year
- 基于 faster-whisper 的伪实时语音转写服务☆234Updated 8 months ago
- 基于Linly-Talker数字人改版的教育系统,包含网课总结、数字人对话、Chatbot对话,项目可在autodl部署☆35Updated last year
- Sample GLM4V + ChatTTS AI assistant☆85Updated last year
- 阿里SenseVoice的fastpi封装,采用onnx发布,体积更小,附带量化模型,支持GPU。支持从URL文件进行语音识别。☆104Updated last year
- ☆51Updated last year
- MaskGCT-Windows For Windows Users☆66Updated 7 months ago
- 如果想体验小智项目,或者开发server端测试的同志,可以使用这个web端damo 体验下。 语音端已经完成,文字端完成,可以语音加文字输出。 等迭代慢慢完善。欢迎PR☆175Updated 7 months ago
- 这是一个 ChatTTS 音频仓库,包含用不同 seed 生成的不同音色,你可以方便地挑选你喜欢的 seed。☆51Updated last year
- 鬼畜视频配音字幕同步项目,基于字幕文件srt同步接入TTS,支持GPT-Sovits ChatTTS BertVits2☆46Updated last year
- 低成本的简单基于live2d TTS文字转语音和大模型聊天的直播解决方案☆274Updated last year
- 封装GPT-Sovits-Interface,可用于用于多角色多情感有声中文小说制作☆36Updated last year