win4r / VideoFinder-Llama3.2-vision-Ollama
VideoFinder is an advanced video analysis tool powered by multimodal AI, designed to help users easily locate and identify specific objects or people within video content. By combining the capabilities of Llama Vision model with a streamlined web interface, it enables real-time, frame-by-frame video analysis with natural language descriptions.
☆145Updated 6 months ago
Alternatives and similar repositories for VideoFinder-Llama3.2-vision-Ollama
Users that are interested in VideoFinder-Llama3.2-vision-Ollama are comparing it to the libraries listed below
Sorting:
- 文本语料转训练集工具,txt转dataset☆92Updated last year
- An common framework for voice and text interactions with LLMs☆93Updated 6 months ago
- GraphRAG-Ollama-UI + GraphRAG4OpenWebUI 融合版(有gradio webui配置生成RAG索引,有fastapi提供RAG API服务)☆110Updated 8 months ago
- ChatTTS HTTP API☆53Updated 11 months ago
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆54Updated 3 months ago
- 使用 FastAPI、Streamlit本地部署ChatTTS文本转语音模型,并通过 Docker Compose 进行容器化部署。☆25Updated 7 months ago
- 阿里SenseVoice的fastpi封装,采用onnx发布,体积更小,附带量化模型,支持GPU。支持从URL文件进行语音识别。☆81Updated 8 months ago
- xclabel是一款支持多人协作的,样本导入+样本标注+模型训练+模型管理+模型测试+模型导出的工具☆129Updated 2 months ago
- Sample GLM4V + ChatTTS AI assistant☆84Updated 11 months ago
- 视频理解:千问视频多模态模型 & Dify☆53Updated 8 months ago
- 实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果☆377Updated 4 months ago
- ☆148Updated 10 months ago
- Using GPT to parse PDF☆97Updated 8 months ago
- 基于 faster-whisper 的伪实时语音转写服务☆212Updated 2 weeks ago
- ragflow中的ocr部分,非官方项目☆40Updated 8 months ago
- AutoGen最新架构v0.4正式发布第一个稳定版本,v0.4是对AutoGen的一次从头开始的重写,目的是为构建Agent创建一个更健壮、可扩展、更易用的跨语言库,其应用接口采用分层架构设计,存在多套软件接口用以满足不同的场景需求 。☆103Updated last month
- XAgent 教程☆36Updated last year
- 实现使用开源的LangFlow框架,零代码实现大模型相关应用如流量包推荐智能客服、RAG应用等,并使用两种方式将创建的工作流集成到自己的项目中☆23Updated 8 months ago
- LLM voice chat project by Connect ChatTTS with Local Ollama, 连接本地部署的 Ollama 和 ChatTTS,实现和LLM的语音对话☆62Updated 9 months ago
- ☆58Updated 6 months ago
- 通过装饰器将函数接入OpenAI的Chat☆46Updated 2 months ago
- ☆246Updated 4 months ago
- virtualwife-llm-factory 是一个llm训练框架,用于解决虚拟角色训练入门门槛高的问题,该框架具备自动生成语料,性格塑造评估,基于国产llm微调训练等核心能力,目前还在开发,可以点个star~ 关注一下☆47Updated 10 months ago
- ChatTTS is a generative speech model for daily dialogue.this fork Support ollama☆44Updated 11 months ago
- Phi3 中文后训练模型仓库☆321Updated 5 months ago
- 基于Linly-Talker数字人改版的教育系统,包含网课总结、数字人对话、Chatbot对话,项目可在autodl部署☆27Updated 11 months ago
- This repo is to use chatTTS and Ollama to create local LLM audio tool.☆33Updated 10 months ago
- 🔥 Turn entire websites into LLM-ready markdown☆87Updated last year
- ☆57Updated 7 months ago
- LangChain Tutorial 2 实现 AI 女友 Demo☆51Updated last year