Henry-23 / VideoChat
实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.
☆268Updated this week
Related projects ⓘ
Alternatives and complementary repositories for VideoChat
- The fastest digital human algorithm, now on your desktop.☆256Updated this week
- ☆112Updated this week
- ☆124Updated this week
- JoyHallo: Digital human model for Mandarin☆290Updated last month
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆359Updated this week
- ☆302Updated 3 weeks ago
- gradio WebUI for AdvancedLivePortrait☆260Updated this week
- MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code☆405Updated 3 weeks ago
- Bring portraits to life via Monitor!☆255Updated 3 months ago
- 实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果☆248Updated 4 months ago
- video to video translation with voice clone and lip synchronization|带有语音克隆和口型同步的视频翻译,支持中英互换☆109Updated 6 months ago
- EZ-Work AI文档翻译,人人可用的开源AI文档翻译助手,可以快速低成本调用OpenAI等大语言模型api,帮助您实现txt/markdown/word/csv/excel/pdf/ppt的文档翻译。☆122Updated this week
- 一个超轻量级、可以在移动端实时运行的数字人模型☆858Updated this week
- 洛曦 数字人视频播放器,带HTTP API,使用gradio api对接Easy-Wav2Lip、Sadtalker、GeneFacePlusPlus、MuseTalk,也可以用于播放本地视频☆144Updated 3 weeks ago
- 这是一个基于 Next.js 构建的多语言 AI 模型评估平台,支持多模型对比和实时流式响应。A multilingual AI model evaluation platform built with Next.js, allowing users to compare …☆70Updated 3 weeks ago
- ☆99Updated last week
- 10000 chatTTS voices !chatTTS 音色库,再也不为音色抽卡烦恼啦。这是我第一个项目,熬夜龟速生产10000条音色并上传Github,给点鼓励呗哈!主域名:https://www.TTSlist.com 备用:http://ttslist.aiqb…☆139Updated 3 months ago
- 一个用于CosyVoice的api接口项目☆78Updated 3 weeks ago
- TianMu: A modern AI tool with multi-platform support, markdown support, multimodal, continuous conversation, and customizable commands. 一…☆84Updated last year
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆149Updated last week
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆42Updated 2 months ago
- Awesome Digital Human☆894Updated last week
- ☆487Updated 2 weeks ago
- ☆286Updated 3 months ago
- 记录文生图、文生视频、大语言模型等 AI 相关技术在发展过程中的重要时间点☆68Updated 2 months ago
- 开发ing,将Dify接入飞书机器人☆86Updated 4 months ago
- 😜 表情包视觉数据集,使用glm-4v、step-1v的图像解析能力标注。☆98Updated 6 months ago
- ☆166Updated last month
- Real time streaming talking head☆437Updated 5 months ago
- ☆116Updated 2 months ago