HansonJames / general_digital_human_systemView external linksLinks
通用数字人系统是一个基于深度学习和WebRTC技术的智能交互平台,集成了Azure Avatar数字人渲染、语音识别合成、自然语言处理等技术。系统支持实时对话、知识问答和情感交互,可实现30FPS以上的流畅渲染和200ms以内的低延迟响应。核心功能包括 基于GPT的智能对话、多语言及方言支持、知识库实时检索、表情情感同步等。采用FastAPI和LangChain框架开发,支持Docker一键部署,可广泛应用于智能客服、远程教育、数字展厅等场景。系统提供完整的二次开发接口,支持功能扩展和场景定制。
☆27Dec 17, 2025Updated last month
Alternatives and similar repositories for general_digital_human_system
Users that are interested in general_digital_human_system are comparing it to the libraries listed below
Sorting:
- 本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。☆28Feb 26, 2024Updated last year
- Automatic defect recognition in X-ray testing using computer vision☆12Dec 8, 2018Updated 7 years ago
- 增加了indextts2的简单的界面与api调用方式☆20Oct 27, 2025Updated 3 months ago
- 私有化自动数字人排队训练、短视频排队生成的微信小程序、web运营后台管理系统一键部署,基于单人训练的音频驱动唇形,比wav2lip、deepfacelab、liveportrait、musetalk等等唇形方案更好,直接可以商业化,支持中日英韩多种语音复刻☆56Apr 14, 2025Updated 10 months ago
- A vote and lottery App for Wechat☆13Dec 16, 2013Updated 12 years ago
- IntelligetnScissor implemented by C++.☆12Apr 20, 2018Updated 7 years ago
- ☆10Apr 20, 2019Updated 6 years ago
- ☆12Mar 1, 2023Updated 2 years ago
- Chest Xray Classifier using CNNs and Transfer Learning. The jupyter notebook of interest is titled 'Xrays_alt.ipynb'☆11May 18, 2018Updated 7 years ago
- A custom watcher plugin for Elasticsearch that feeds Apache Kafka☆11Mar 9, 2018Updated 7 years ago
- mobileNet SSD 基于caffe的前向检测☆10Nov 30, 2018Updated 7 years ago
- Acoustic Scene Classification using transfer learning on VGGish pre-trained model☆11Jan 3, 2018Updated 8 years ago
- This project implements the Titans architecture from the paper "Titans: Learning to Memorize at Test Time" for market data prediction.☆11Jan 19, 2025Updated last year
- Monte Carlo Tree Search (MCTS) ,realize using python☆12Mar 10, 2016Updated 9 years ago
- Face++ 是一款基于 Android 平台开发的创新性 AI 面相分析应用。它巧妙地将中国传统面相学理论(如“三庭五眼”和“十二宫”)与现代人工智能技术相结合,为用户提供一份专业、详尽且富有洞察力的面相分析报告☆21Jul 14, 2025Updated 7 months ago
- cv-warpPolar-exampleは、OpenCVでの極座標変換/逆変換の実行例です。☆11Jul 11, 2020Updated 5 years ago
- 在index-tts-vllm的基础上,实现了并提供了模拟流式合成音频的接口服务及客户端测试脚本☆26Sep 2, 2025Updated 5 months ago
- 红外热成像显示伪彩色和温度☆16Oct 7, 2021Updated 4 years ago
- ☆83May 20, 2025Updated 8 months ago
- This repository contains my Implementation of hybrid A star for a vehicle with Ackerman steering to perform complex parking maneuvers in …☆13Mar 23, 2023Updated 2 years ago
- 短视频内容理解与推荐竞赛☆12Feb 18, 2019Updated 6 years ago
- A model to read multiple Analog gauge reading using computer Vision☆10Jun 28, 2020Updated 5 years ago
- ☆13Jan 27, 2016Updated 10 years ago
- Concise implementation of image-to-image translation.☆14Jun 19, 2018Updated 7 years ago
- keyframe: a simple tool for selecting keyframes from SfM (structure from motion) videos☆11Jun 27, 2017Updated 8 years ago
- Nonrigid Iterative Closest Point Algorithm☆10Feb 19, 2016Updated 9 years ago
- unofficial implementation of YOLOP TensorRT☆14Dec 11, 2021Updated 4 years ago
- ☆14Apr 18, 2023Updated 2 years ago
- 白话深度学习与Tensorflow☆11May 5, 2018Updated 7 years ago
- 自动生成短视频,文章自动成片,多模态混剪,数字人,声音克隆☆13Jun 25, 2024Updated last year
- 机器学习数据集 深度学习数据集☆15Aug 8, 2022Updated 3 years ago
- official codes for FRD-UVAD(10 crop version)☆16Nov 2, 2024Updated last year
- 对labelme网页版框选得到的xml文件处理得到VOC2007格式的所有数据,包括图片,xml文件及训练的txt文件☆16Aug 2, 2017Updated 8 years ago
- AlitaNet: A click through rate (ctr) prediction deep learning Network implementation with TensorFlow, including LR, FM, AFM, Wide&Deep, D…☆15Aug 13, 2019Updated 6 years ago
- Shape Context code☆23Aug 31, 2017Updated 8 years ago
- 📚 OpenAI API 完整功能演示项目,包含: • ChatGPT/GPT-4 对话 • DALL-E 图像生成 • Whisper 语音转换 • 文本嵌入搜索 • RAG 知识库系统 • Assistants API 应用 • 提示词工程最佳实践 🔥 特点: •…☆21Nov 10, 2025Updated 3 months ago
- 水下机器人小组作品☆14Oct 7, 2017Updated 8 years ago
- Finetuned Deepseek 8b model for finance reasoning☆17Feb 14, 2025Updated last year
- 本单元是broadview中<TensorFlow:实战Google深度学习框架>代码☆16Jul 30, 2024Updated last year