Henry-23 / VideoChat
实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.
☆413Updated this week
Related projects ⓘ
Alternatives and complementary repositories for VideoChat
- The fastest digital human algorithm, now on your desktop.☆296Updated this week
- JoyHallo: Digital human model for Mandarin☆338Updated this week
- 一个超轻量级、可以在移动端实时运行的数字人模型☆1,002Updated last week
- ☆126Updated this week
- Real time streaming talking head☆440Updated 6 months ago
- ☆116Updated last week
- Awesome Digital Human☆931Updated last week
- 🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.☆854Updated last week
- 实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果☆254Updated 4 months ago
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆363Updated 2 weeks ago
- ☆499Updated 3 weeks ago
- 每个人都能用的数字人☆700Updated 2 weeks ago
- 洛曦 数字人视频播放器,带HTTP API,使用gradio api对接Easy-Wav2Lip、Sadtalker、GeneFacePlusPlus、MuseTalk,也可以用于播放本地视频☆145Updated last month
- An open-source LLM based automatically daily news collecting workflow showcase powered by Agently AI application development framework.☆443Updated 3 weeks ago
- Official implementation of the paper "TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion I…☆314Updated 3 weeks ago
- 10000 chatTTS voices !chatTTS 音色库,再也不为音色抽卡烦恼啦。这是我第一个项目,熬夜龟速生产10000条音色并上传Github,给点鼓励呗哈!主域名:https://www.TTSlist.com 备用:http://ttslist.aiqb…☆140Updated 4 months ago
- MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code☆424Updated last month
- ☆307Updated last month
- ChatPilot: Chat Agent Web UI,实现Chat对话前端,支持Google搜索、文件网址对话(RAG)、代码解释器功能,复现了Kimi Chat(文件,拖进来;网址,发出来)。☆510Updated last week
- ☆287Updated 3 months ago
- An open-source AI content search engine designed specifically for content creators. Supports extraction of text, images, and short videos…☆503Updated 4 months ago
- ChatTTS 2000条音色稳定性打分🥇+区分男女年龄👧+在线试听🔈 ChatTTS 2K Speaker Stability Score & Categorized by Gender and Age & Audio Preview☆534Updated 4 months ago
- 自动视频生成器,给定主题,自动生成解说视频。用户输入主题文字,系统调用大语言模型生成故事或解说的文字,然后进一步调用语音合成接口生成解说的语音,调用文生图接口生成 契合文字内容的配图,最后融合语音和配图生成解说视频。☆524Updated last week
- 实时互动的GPT数字人☆293Updated 2 weeks ago
- ☆1,045Updated 5 months ago
- AI吟美-人工智能主播-Vtuber☆584Updated 2 months ago
- 最快油管英文视频转中文☆280Updated 4 months ago
- 数字人资料整理☆460Updated 2 weeks ago
- EZ-Work AI文档翻译,人人可用的开源AI文档翻译助手,可以快速低成本调用OpenAI等大语言模型api,帮助您实现txt/markdown/word/csv/excel/pdf/ppt的文档翻译。☆128Updated this week
- WebDesignAgent : Towards Effortless Website Creation☆239Updated 2 months ago