byjlw / video-analyzerLinks
Analyze videos using LLMs, Computer Vision and Automatic Speech Recognition
☆945Updated 3 months ago
Alternatives and similar repositories for video-analyzer
Users that are interested in video-analyzer are comparing it to the libraries listed below
Sorting:
- A Training-free Iterative Framework for Long Story Visualization☆906Updated 6 months ago
- ☆259Updated 11 months ago
- 实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, su…☆1,027Updated 4 months ago
- Project Page repo of OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication☆368Updated 3 months ago
- The fastest digital human algorithm, now on your desktop.☆536Updated last month
- Easegen is an open-source digital human course creation platform offering comprehensive solutions from course production and video manage…☆237Updated 3 months ago
- ☆1,561Updated this week
- 一个用于CosyVoice的api接口项目☆298Updated 6 months ago
- An common framework for voice and text interactions with LLMs☆94Updated 8 months ago
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆449Updated 8 months ago
- JoyHallo: Digital human model for Mandarin☆500Updated 8 months ago
- talking-face video editing☆367Updated 4 months ago
- Awesome Digital Human☆1,915Updated last week
- ☆584Updated 9 months ago
- AI virtual human bot framework(public)☆151Updated this week
- ☆584Updated this week
- Open CapCut API.☆396Updated this week
- AI ContentCraft is an all-in-one content creation suite that helps creators generate stories, podcast scripts, and multimedia content usi…☆367Updated 3 weeks ago
- ☆500Updated 5 months ago
- ☆322Updated 3 weeks ago
- Real time streaming talking head☆481Updated last year
- Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"☆2,927Updated 3 weeks ago
- Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…☆1,174Updated 3 months ago
- AI-Powered Watermark Remover using Florence-2 and LaMA Models: A Python application leveraging state-of-the-art deep learning models to e…☆644Updated 2 months ago
- project page for ChatAnyone☆111Updated 3 months ago
- CosyVoice在苹果MacOs上使用的版本☆135Updated 10 months ago
- 快速提取音视频内容,整理成一份结构化的markdown笔记☆1,807Updated last year
- VideoFinder is an advanced video analysis tool powered by multimodal AI, designed to help users easily locate and identify specific objec…☆156Updated 8 months ago
- 实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果☆401Updated 6 months ago
- ☆178Updated 5 months ago