KylinMountain / transvideoLinks
使用大语言模型自动翻译视频字幕,并采用反思策略优化字幕,最后通过chattts合成语音并合并到原视频中。
☆10Updated 11 months ago
Alternatives and similar repositories for transvideo
Users that are interested in transvideo are comparing it to the libraries listed below
Sorting:
- 基于大模型生成内容的智能语音对讲☆10Updated 8 months ago
- agentcp是一个基于ACP协议的Agent sdk,用于解决Agent间的身份认证及通信问题;用于创建AID、连接入网、构建会话,收发消息等;支持多Agent协作,异步消息处理,支持内网穿透,支持Agent访问的负载均衡☆11Updated last week
- 基于 Dify + Langfuse 的自动化评估服务☆70Updated last month
- LLM智能路由网关、 Enterprise Intelligent AI-API Distribution Gateway☆12Updated 5 months ago
- Examples for QinYan GLMs☆13Updated 10 months ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆28Updated 10 months ago
- 本项目系列相关视频为大家测试CrewAI官方提供的Tools☆11Updated 6 months ago
- 如何让 dify工作流的 code 节点拿到图片的信息☆24Updated 4 months ago
- ☆23Updated 3 months ago
- mcp的webui界面,支持客户端连接多个sse服务端,支持 openai、deepseek、qwen等大模型,另外附上构建的 agent的 stdio和sse的简单 天气查询的完整示例☆32Updated last month
- 对接 Dify不同应用的 API,从而对接 自己的业务系统,实现与 Dify 应用的对话流处理,将对话结果流式返回给前端,并将对话结果分发给开发者自行处理☆11Updated 10 months ago
- ☆57Updated 8 months ago
- 基于FastAPI的语音服务系统,集成语音合成(TTS)和语音识别(STT)功能。使用CosyVoice2作为TTS引擎,FunASR作为STT引擎,支持零样本语音克隆、流式输出、多种语言识别等高级功能。☆11Updated 3 months ago
- zlai☆22Updated 9 months ago
- MinerU API server☆65Updated 7 months ago
- dify DSL files collections☆31Updated 11 months ago
- GPT+神器,简单实用的一站式AGI架构,内置本地化,LLM模型,agent,矢量数据库,智能链chain☆48Updated 2 years ago
- ☆19Updated 10 months ago
- Text2Neo4j 是一个遍历文档、从文本中提取关系并将其保存到 Neo4j 数据库中以形成知识图谱的工具。本项目结合了 Dify 和 LLaMA3.1(8B 模型)来高效处理和提取复杂关系。☆19Updated 10 months ago
- A powerful task-oriented dialogue agent that can collect information through structured conversations. It supports dynamic field validati…☆33Updated 2 months ago
- 01. Enabling various applications to be AI-enabled or used by AI.☆26Updated last year
- Convert files into markdown to help RAG or LLM understand, based on markitdown and MinerU, which could provide high quality pdf parser.☆118Updated 3 months ago
- Dive into LLM Agents☆18Updated last year
- ☆33Updated last year
- 视频理解:千问视频多模态模型 & Dify☆60Updated 10 months ago
- 02. Enabling various applications to be AI-enabled or used by AI.☆29Updated 10 months ago
- Fine-tuning embedding models.☆13Updated 7 months ago
- ☆254Updated 6 months ago
- 基于SenseVoice的funasr版本进行的api发布,可以无缝对接oneapi☆67Updated 10 months ago
- 使用 FastAPI、Streamlit本地部署ChatTTS文本转语音模型,并通过 Docker Compose 进行容器化部署。☆27Updated 9 months ago