byjlw / video-analyzerLinks
Analyze videos using LLMs, Computer Vision and Automatic Speech Recognition
☆1,035Updated 5 months ago
Alternatives and similar repositories for video-analyzer
Users that are interested in video-analyzer are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication☆381Updated last week
- ☆276Updated last year
- A Training-free Iterative Framework for Long Story Visualization☆918Updated 8 months ago
- 实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, su…☆1,102Updated 6 months ago
- The fastest digital human algorithm, now on your desktop.☆550Updated 3 months ago
- AutoClip: AI-powered video clipping and highlight generation · 一款智能高光提取与剪辑的二创工具☆840Updated last week
- JoyHallo: Digital human model for Mandarin☆506Updated this week
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆456Updated 10 months ago
- 一个用于CosyVoice的api接口项目☆309Updated 3 weeks ago
- Easegen is an open-source digital human course creation platform offering comprehensive solutions from course production and video manage…☆242Updated 5 months ago
- Open CapCut API.☆1,015Updated this week
- RTC AIGC Demo☆204Updated 2 months ago
- Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…☆1,430Updated this week
- ☆621Updated 2 months ago
- talking-face video editing☆378Updated 7 months ago
- VideoFinder is an advanced video analysis tool powered by multimodal AI, designed to help users easily locate and identify specific objec…☆162Updated 10 months ago
- ☆2,497Updated this week
- AI-Powered Video Retrieval & Clipping Tool☆340Updated last month
- Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"☆3,041Updated 3 months ago
- Unlimited-length talking video generation that supports image-to-video and video-to-video generation☆1,723Updated last month
- ☆362Updated 2 months ago
- CosyVoice在苹果MacOs上使用的版本☆142Updated last year
- gradio WebUI for AdvancedLivePortrait☆511Updated 6 months ago
- [SIGGRAPH 2025] LAM: Large Avatar Model for One-shot Animatable Gaussian Head☆775Updated 2 weeks ago
- MagicTryOn is a video virtual try-on framework based on a large-scale video diffusion Transformer.☆458Updated last month
- 快速提取音视频内容,整理成一份结构化的markdown笔记☆1,872Updated last year
- MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting…☆965Updated last week
- Awesome Digital Human☆2,060Updated last month
- ReMe: Memory Management Framework for Agents - Remember Me, Refine Me.☆537Updated last week
- Diffusion-based Portrait and Animal Animation☆833Updated this week