win4r / VideoFinder-Llama3.2-vision-Ollama
VideoFinder is an advanced video analysis tool powered by multimodal AI, designed to help users easily locate and identify specific objects or people within video content. By combining the capabilities of Llama Vision model with a streamlined web interface, it enables real-time, frame-by-frame video analysis with natural language descriptions.
☆119Updated 2 months ago
Alternatives and similar repositories for VideoFinder-Llama3.2-vision-Ollama:
Users that are interested in VideoFinder-Llama3.2-vision-Ollama are comparing it to the libraries listed below
- Sample GLM4V + ChatTTS AI assistant☆85Updated 7 months ago
- 文本语料转训练集工具,txt转dataset☆82Updated 8 months ago
- ☆172Updated last month
- GraphRAG-Ollama-UI + GraphRAG4OpenWebUI 融合版(有gradio webui配置生成RAG索引,有fastapi提供RAG API服务)☆92Updated 5 months ago
- Cross Platform Open Sourced Chinese NoteBookLM app based on Electron, Use DeepSeek + Reecho.ai☆55Updated 2 months ago
- ChatTTS is a generative speech model for daily dialogue.this fork Support ollama☆34Updated 8 months ago
- ☆136Updated this week
- TTS☆77Updated 8 months ago
- Using GPT to parse PDF☆84Updated 4 months ago
- 一个用于CosyVoice的api接口项目☆157Updated last week
- A code executor for Dify that is compatible with the official sandbox API calls and dependency installation.☆83Updated last month
- 为AI带路党Pro视频准备☆99Updated 3 months ago
- 本项目是基于dify开源项目实现的dsl工作流脚本合集☆49Updated this week
- 添加🚀流式 Web 服务到 GraphRAG,兼容 OpenAI SDK,支持可访问的实体链接🔗,支持建议问题,兼容本地嵌入模型,修复诸多问题。Add streaming web server to GraphRAG, compatible with OpenAI SD…☆225Updated 2 weeks ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆26Updated 4 months ago
- ChatTTS HTTP API☆51Updated 7 months ago
- 基于Dify自主创建的AI应用DSL工作流,你可以免费获取,无论是出于个人需求还是学习目的,它都能为您开启一段充满无限可能的智能之旅。☆141Updated 3 weeks ago
- Easegen is an open-source digital human course creation platform offering comprehensive solutions from course production and video manage…☆172Updated this week
- This repo is to use chatTTS and Ollama to create local LLM audio tool.☆20Updated 6 months ago
- 与 https://github.com/tonori/mem0ai-api 配合使用的非官方的 mem0ai provider.☆47Updated 6 months ago
- A LLM RAG system runs on your laptop. 大模型检索增强生成系统,可以轻松部署在笔记本电脑上,实现本地知识库智能问答。☆126Updated last month
- Unsloth框架在Windows平台微调 训练Qwen2大模型,非WSL☆46Updated 7 months ago
- An common framework for voice and text interactions with LLMs☆85Updated 2 months ago
- ☆58Updated 3 months ago
- generate ppt with llm☆77Updated 11 months ago
- Dive into LLM Agents☆17Updated 7 months ago
- A function calling tool can be deployed to Cloudflare Workers with openapi schema☆72Updated 6 months ago
- 百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,时延低至800ms,低配置也可运行,支持打断☆432Updated last week
- 🔥 Turn entire websites into LLM-ready markdown☆65Updated 8 months ago
- 阿里SenseVoice的fastpi封装,采用onnx发布,体积更小,附带量化模型,支持GPU。支持从URL文件进行语音识别。☆51Updated 4 months ago