chn-lee-yumi / MaterialSearch
AI语义搜索本地素材。以图搜图、查找本地素材、根据文字描述匹配画面、视频帧搜索、根据画面描述搜索视频。Semantic search. Search local photos and videos through natural language.
☆1,328Updated 2 weeks ago
Alternatives and similar repositories for MaterialSearch:
Users that are interested in MaterialSearch are comparing it to the libraries listed below
- ☆687Updated 9 months ago
- CosyVoice在Windows环境下使用的版本☆648Updated 4 months ago
- 可本地部署的AI语音工具箱 | A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.☆754Updated this week
- A webui for propainter. Easily pick up objects from the video and eliminate them.☆275Updated 8 months ago
- 本项目意图在于让使用各类语音合成引擎的方式变得统一,支持多种语音合成引擎适配器,允许直接作为模组使用或启动后端服务☆704Updated 11 months ago
- 🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。☆2,349Updated 8 months ago
- 基于Faster-whisper和modelscope一键生成双语字幕,双语字幕生成器,基于离线大模型,Generate bilingual subtitles with one click based on Faster-whisper and modelscope. O…☆374Updated 3 months ago
- ChatTTS 2000条音色稳定性打分🥇+区分男女年龄👧+在线试听🔈 ChatTTS 2K Speaker Stability Score & Categorized by Gender and Age & Audio Preview☆641Updated 8 months ago
- 🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.☆1,142Updated this week
- 实验ai小说☆295Updated last week
- GAG is a GUI for GPT-SoVITS inference. Just add it to the official integration package and run for a smoother experience.☆65Updated 2 months ago
- 使用ai生成多章节的长篇小说,自动衔接上下文、伏笔☆987Updated last week
- 快速提取音视频内容,整理成一份结构化的markdown笔记☆1,563Updated 8 months ago
- 基于大模型的智能对话客服工具,支持微信、拼多多、千牛、哔哩哔哩、抖音企业号、抖音、抖店、微博聊天、小红书专业号运营、小红书、知乎等平台接入,可选择 GPT3.5/GPT4.0/ 懒人百宝箱 (后续会支持更多平台),能处理文本、语音和图片,通过插件访问操作系统和互联网等外部资…☆2,925Updated 3 months ago
- 轻量、灵活、易上手的Python剪映草稿生成及导出工具,构建全自动化视频剪辑/混剪流水线☆489Updated this week
- AI吟美-人工智能主播-Vtuber☆714Updated 6 months ago
- Inference Specialization☆429Updated 9 months ago
- faster_whisper GUI with PySide6☆2,221Updated 3 months ago
- 一个高自由度的端到端的可定制AI-VTuber。支持对接哔哩哔哩直播间,以智谱API作为语言基座模型,拥有意图识别、长短期记忆(直接记忆和联想记忆),支持搭建认知库、歌曲作品库,接入了当前热门的一些语音转换、语音合成、图像生成、数字人驱动项目,并提供了一个便于操作的客户端。☆381Updated 6 months ago
- 集成主流开源大模型,实现不同类型大模型以及同类型大模型之间的协调合作。☆58Updated last week
- ✨ AsrTools: Smart Voice-to-Text Tool | Efficient Batch Processing | User-Friendly Interface | No GPU Required | Supports SRT/TXT Output |…☆1,969Updated 4 months ago
- 【脱离复杂的环境配置和整合包,极简配置推理服务】从GPT-SoVITS项目里面提取出来的,纯粹的推理服务方案。☆266Updated 11 months ago
- 这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。☆2,296Updated 7 months ago
- an extremely simple tool for separating vocals and background music, completely localized for web operation, using 2stems/4stems/5stems …☆1,500Updated 4 months ago
- 适用于 GPT-SoVITS 的api调用接口☆252Updated last year
- 官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project☆1,566Updated 8 months ago
- vits2 backbone with bert☆338Updated 11 months ago
- 每个人都能用的数字人☆1,229Updated last week
- AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/…☆3,636Updated 3 weeks ago
- 低成本的简单基于live2d TTS文字转语音和大模型聊天的直播解决方案☆244Updated 8 months ago