win4r / VideoFinder-Llama3.2-vision-OllamaView external linksLinks
VideoFinder is an advanced video analysis tool powered by multimodal AI, designed to help users easily locate and identify specific objects or people within video content. By combining the capabilities of Llama Vision model with a streamlined web interface, it enables real-time, frame-by-frame video analysis with natural language descriptions.
☆169Nov 8, 2024Updated last year
Alternatives and similar repositories for VideoFinder-Llama3.2-vision-Ollama
Users that are interested in VideoFinder-Llama3.2-vision-Ollama are comparing it to the libraries listed below
Sorting:
- ☆10Oct 23, 2024Updated last year
- CosyVoice语音合成简易API☆14Nov 1, 2024Updated last year
- ☆12Nov 21, 2025Updated 2 months ago
- xclabel是一款支持多人协作的,样本导入+样本标注+模型训练+模型管理+模型测试+模型导出的工具☆12Mar 11, 2025Updated 11 months ago
- AI Search engine☆13Sep 24, 2025Updated 4 months ago
- This repository is an offical PyTorch implementation of SD-GAN: Semantic Decomposition for Face Image Synthesis with Discrete Attribute.☆13Mar 18, 2024Updated last year
- 基于youtube、bilibili等视频平台、webpage网页等,利用零一万物大模型或ollama本地小模型构建大语言模型高质量训练数据集(计划支持可自定义输出的训练数据格式)☆19May 2, 2024Updated last year
- MultiModal RAG using Qwen 2 VL and Colpali.☆20Sep 28, 2024Updated last year
- ☆14May 21, 2024Updated last year
- ☆15Apr 28, 2023Updated 2 years ago
- AI_Video_Shorts_Creator is a python-based tool that uses OpenAI's GPT-4 power to automatically analyze videos, extract the most interesti…☆18Sep 22, 2023Updated 2 years ago
- 基于vllm部署qwen2.5_vl实现视频流的实时识别☆20Apr 1, 2025Updated 10 months ago
- tensorflow label tool 快速图像标注工具☆20Jun 20, 2018Updated 7 years ago
- 在人工智能时代,AI凭借其强大的任务处理能力,逐渐成为不同岗位工作人员手中的得力工具。但对于视频解析的AI因为种种原因,效果并不理想 本项目是用Python制作的一个能够调用qwen-vl的程序,通过ffmpeg对视频逐帧切分后调用qwen-vl实现对视频的解析☆51Jan 18, 2026Updated 3 weeks ago
- Recognize faces and objects in the video based on Milvus.☆22Aug 10, 2021Updated 4 years ago
- 基于 Rabbitmq 的 Http 异步消息调用服务☆42Jan 7, 2023Updated 3 years ago
- A SDK to using the Realtime API with Microcontrollers like the ESP32☆23Apr 13, 2025Updated 10 months ago
- 本项目的小智适配版本☆24Mar 2, 2025Updated 11 months ago
- GraphRAG在2024.11.5发布 0.4.0新版本,引入增量更新索引和DRIFT图推理搜索查询,本项目对新增的两个新功能进行全面测试,并提供了一种支持多类型大模型使用GraphRAG解决方案,不仅支持GPT大模型,还支持本地大模型(Ollama)、阿里云通义千问、百…☆48Dec 1, 2024Updated last year
- 这是一个用于连接小智AI服务的Python客户端库。它提供了简单的接口来进行语音对话和文本交互。☆26Mar 14, 2025Updated 11 months ago
- 这是一个基于FastAPI的智能视频识别系统,集成了Ollama大模型,能够实时处理RTSP视频流并提供AI驱动的内容识别功能。系统采用现代化的Web界面设计,支持多终端访问,为视频监控和内容分析提供了强大的解决方案。☆37Jun 17, 2025Updated 8 months ago
- A Next.js version of Claude Aritfacts , inspired by llamacoder☆27Sep 26, 2024Updated last year
- Aila(AI超元域): The premier AI integration tool for Windows, macOS, and Android. Ask once, get answers from 10+ AIs like ChatGPT, Gemini, Cl…☆1,812Dec 25, 2025Updated last month
- Using Groq or OpenAI or Ollama to create o1-like reasoning chains☆290Sep 17, 2024Updated last year
- 实现使用开源的LangFlow框架,零代码实现大模型相关应用如流量包推荐智能客服、RAG应用等,并使用两种方式将创建的工作流集成到自己的项目中☆31Sep 9, 2024Updated last year
- 立创·实战派ESP32-C3 开发板 学习实践☆28Oct 30, 2024Updated last year
- 这是一个基于 `PyQt5` 和 `Python` 的网络信息抓取工具,可自动从互联网搜索引擎中抓取与关键词相关的内容,并将结果保存至本地文件,同时支持文本复制到剪贴板。支持的搜索引擎包括 Google, Bing, Baidu, 和 Sogou。This is a web…☆25May 20, 2024Updated last year
- Build AI-powered applications with React, Svelte, Vue, and Solid☆67Nov 15, 2024Updated last year
- 安防视频云平台,输入支持各种格式(rtsp、gb28181、sdk视频图片等),输出统一为视频流和图片流,对接算法平台(人脸识别、车牌识别、目标检测等),支持报警事件联动,预览输出支持hls和http-flv,所有服务器集群运行,高并发☆24Sep 24, 2020Updated 5 years ago
- 🛍 A full E-commerce app with nice UI consists of on-boarding, login, sign-up, home, product details, cart and user profile.☆10Sep 8, 2024Updated last year
- ☆23Jan 16, 2024Updated 2 years ago
- ☆28Jan 10, 2025Updated last year
- 小智的视觉对话☆32Apr 25, 2025Updated 9 months ago
- 小智AI MCP功能扩充,包含Onvif摄像头画面截取识别,PC软件控制等☆122Oct 27, 2025Updated 3 months ago
- 本项目为xiaozhi-esp32提供C++后端服务,帮助您快速搭建ESP32设备控制服务器。Backend C++ service for xiaozhi-esp32, helps you quickly build an ESP32 device control ser…☆32Apr 21, 2025Updated 9 months ago
- A Framework for Narrative Agents☆37Sep 24, 2024Updated last year
- A Model Context Protocol (MCP) server that provides JSON-RPC functionality through OpenRPC.☆43Apr 23, 2025Updated 9 months ago
- Web UI for OpenAvatarChat☆59Aug 28, 2025Updated 5 months ago
- You can use SHMT method to apply makeup to the characters when use ComfyUI☆30Jan 9, 2025Updated last year