win4r / VideoFinder-Llama3.2-vision-Ollama
VideoFinder is an advanced video analysis tool powered by multimodal AI, designed to help users easily locate and identify specific objects or people within video content. By combining the capabilities of Llama Vision model with a streamlined web interface, it enables real-time, frame-by-frame video analysis with natural language descriptions.
☆23Updated this week
Related projects ⓘ
Alternatives and complementary repositories for VideoFinder-Llama3.2-vision-Ollama
- Using GPT to parse PDF☆68Updated 2 months ago
- ✨🦋 illufly 是自我进化的 Agent 框架: 基于自我进化,快速创造价值☆40Updated this week
- This repo is to use chatTTS and Ollama to create local LLM audio tool.☆19Updated 4 months ago
- Sample GLM4V + ChatTTS AI assistant☆84Updated 5 months ago
- 文本语料转训练集工具,txt转dataset☆77Updated 6 months ago
- LangChain Tutorial 2 实现 AI 女友 Demo☆50Updated last year
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆23Updated last month
- ChatTTS is a generative speech model for daily dialogue.this fork Support ollama☆32Updated 5 months ago
- GraphRAG-Ollama-UI + GraphRAG4OpenWebUI 融合版(有gradio webui配置生成RAG索引,有fastapi提供RAG API服务)☆86Updated 2 months ago
- ☆13Updated last month
- Cross Platform Open Sourced Chinese NoteBookLM app based on Electron, Use DeepSeek + Reecho.ai☆40Updated last week
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆63Updated last month
- Using Groq or OpenAI or Ollama to create o1-like reasoning chains☆233Updated last month
- 与 https://github.com/tonori/mem0ai-api 配合使用的非官方的 mem0ai provider.☆33Updated 3 months ago
- ☆47Updated 4 months ago
- ☆68Updated 10 months ago
- LLM voice chat project by Connect ChatTTS with Local Ollama, 连接本地部署的 Ollama 和 ChatTTS,实现和LLM的语音对话☆56Updated 3 months ago
- ☆42Updated last year
- Real time faster whisper gradio☆25Updated last month
- ☆99Updated 3 months ago
- 基于 Dify 构建的高级搜索工具☆17Updated 2 months ago
- ☆112Updated this week
- TTS☆74Updated 5 months ago
- ☆141Updated 4 months ago
- ☆124Updated this week
- A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的文本处…☆194Updated this week
- You can play any API server that compatible with OpenAI API☆21Updated 5 months ago
- GraphRAG4OpenWebUI integrates Microsoft's GraphRAG technology into Open WebUI, providing a versatile information retrieval API. It combin…☆352Updated 3 months ago
- OpenAI tutorials from the basics to advanced☆45Updated 3 weeks ago
- AI Q&A Search Engine ➡️ 基于LangChain和SearXNG打造的开源AI搜索引擎☆107Updated 2 months ago