modelscope / FunClip
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
☆4,020Updated 4 months ago
Alternatives and similar repositories for FunClip:
Users that are interested in FunClip are comparing it to the libraries listed below
- 利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.☆3,343Updated this week
- Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文…☆2,785Updated 2 months ago
- 一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with su…☆6,532Updated last month
- 🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。☆2,212Updated 6 months ago
- 智能视频多语言AI配音/翻译工具 - Linly-Dubbing — “AI赋能,语言无界”☆2,014Updated 4 months ago
- EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning☆3,432Updated last month
- ☆1,116Updated 7 months ago
- AI一键批量生成各类短视频,自动批量混剪短视频,自动把视频发布到抖音,快手,小红书,视频号上,赚钱从来没有这么容易过! 支持本地语音模型chatTTS,fasterwhisper,GPTSoVITS,支持云语音:Azure,阿里云,腾讯云。支持Stable diffusio…☆2,877Updated 2 months ago
- 官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project☆1,374Updated 6 months ago
- Real time interactive streaming digital human☆4,326Updated 2 weeks ago
- Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切…☆9,198Updated last week
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,559Updated 6 months ago
- A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频☆7,876Updated last month
- 快速提取音视频内容,整理成一份结构化的markdown笔记☆1,177Updated 5 months ago
- 一个超轻量级、可以在移动端实时运行的数字人模型☆1,428Updated 2 months ago
- 🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.☆975Updated 2 weeks ago
- MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting☆3,291Updated last month
- 基于大模型的智能对话客服工具,支持微信、拼多多、千牛、哔哩哔哩、抖音企业号、抖音、抖店、微博聊天、小红书专业号运营、小红书、知乎等平台接入,可选择 GPT3.5/GPT4.0/ 懒人百宝箱 (后续会支持更多平台),能处理文本、语音和图片,通过插件访问操作系统和互联网等外部资…☆2,626Updated last month
- Multilingual Voice Understanding Model☆4,097Updated last week
- ☆4,101Updated 2 weeks ago
- 自动化上传视频到社交媒体:抖音、小红书、视频号、tiktok、youtube、bilibili☆2,925Updated last week
- 基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks f…☆5,091Updated 2 weeks ago
- [ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion …☆1,515Updated 5 months ago
- Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LL…☆2,254Updated last week
- SD变现宝:一键把comfyui工作流转换成小程序。☆1,250Updated last month
- EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation☆2,328Updated this week
- Clapper.app, a video synthesizer and sequencer designed for the age of AI cinema☆2,119Updated last month
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,293Updated last month
- SoraWebui is an open-source Sora web client, enabling users to easily create videos from text with OpenAI's Sora model.☆2,329Updated 5 months ago
- 自动视频生成器,给定主题,自动生成解说视频。用户输入主题文字,系统调用大语言模型生成故事或解说的文字,然后进一步调用语音合成接口生成解说的语音,调用文生图接口生成契合文字内容的配图,最后融合语音和配图生成解说视频。☆591Updated 2 months ago