li-xiu-qi / SmartlmageFinderLinks
一个基于多模态向量模型及视觉多模态模型构建的图片搜索引擎&管理系统,实现精准的以文搜文,文搜图、以图搜图多种智能检索方式。An image search engine management system built upon multimodal vector models and visual multimodal models, implementing multiple intelligent search methods including precise text-to-text, text-to-image, and image-to-image retrieval.
☆50Updated this week
Alternatives and similar repositories for SmartlmageFinder
Users that are interested in SmartlmageFinder are comparing it to the libraries listed below
Sorting:
- 欢迎来到“筱可AI研习社”的实战项目仓库!这个仓库主要用于存储和展示为公众号撰写的各类实战项目。我们会不断优化和迭代这些项目,以探索AI的无限可能。☆69Updated last week
- ☆255Updated 7 months ago
- “AI-Compass”将为社区指引在 AI 技术海洋中航行的方向,无论你是初学者还是进阶开发者,都能在这里找到通往 AI 各大方向的路径。旨在帮助开发者系统性地了解 AI 的核心概念、主流技术、前沿趋势,并通过实践掌握从理论到落地的全过程。☆87Updated last week
- Convert files into markdown to help RAG or LLM understand, based on markitdown and MinerU, which could provide high quality pdf parser.☆118Updated 4 months ago
- A unified tool to generate fine-tuning datasets for LLMs, including questions, answers, and dialogues. ✨🤖📚💬☆60Updated 4 months ago
- 视频理解:千问视频多模态模型 & Dify☆62Updated 11 months ago
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR和TTS的开源框架。☆202Updated last week
- A mini assistant to help you read paper quickly☆50Updated 2 months ago
- 支持中文🇨🇳🇨🇳🇨🇳 的 microsoft/graphrag☆50Updated 3 months ago
- ragflow中的ocr部分,非官方项目☆45Updated 11 months ago
- 使用FastAPI+vLLM部署Qwen2.5☆21Updated 10 months ago
- 使用煤矿历史事故案例,事故处理报告、安全规程规章制度、技术文档、煤矿从业人员入职考试题库等数据,微调internlm2模型实现针对煤矿事故和煤矿安全知识的智能问答。☆50Updated 6 months ago
- Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源…☆94Updated 2 weeks ago
- 在RAG技术中,嵌入向量的生成和匹配是关键环节。本文介绍了一种基于CLIP/BLIP模型的嵌入服务,该服务支持文本和图像的嵌入生成与相似度计算,为多模态信息检索提供了基础能力。☆32Updated 7 months ago
- llamafactory blog☆32Updated 9 months ago
- A tool for creating pre-training datasets for language models, supporting one-click batch processing for both text and image datasets. 一个…☆35Updated 7 months ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆24Updated last year
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆58Updated 2 months ago
- [ACL2025 demo track] ROGRAG: A Robustly Optimized GraphRAG Framework☆167Updated last month
- LightRAG与GraphRAG在索引构建、检索测试中的耗时、模型请求次数、Token消耗金额、检索质量等方面进行对比☆110Updated 8 months ago
- KnowFlowRAG☆165Updated this week
- GraphRAG的应用实例,项目特点在于提供了替换OpenAI模型的方法,并通过修改原有提示和切分文档的方法,提高了GraphRAG处理中文内容的能力。☆166Updated 9 months ago
- 本项目借助飞桨平台,构建起一套创新的多模型协同系统,实现 PDF 文件到 Markdown 文件的高效、精准转换。☆21Updated 4 months ago
- ✨🦋 illufly - 【幻蝶】基于记忆蒸馏、资料检索的自我进化智能体☆69Updated last month
- Here is a demo for PDF parser (Including OCR, object detection tools)☆35Updated 9 months ago
- ☆48Updated 4 months ago
- dify's rag patch module☆262Updated last month
- Chat2Graph: Graph Native Agentic System.☆314Updated 2 weeks ago
- A LLM RAG system runs on your laptop. 大模型检索增强生成系统,可以轻松部署在笔记本电脑上,实现本地知识库智能问答。☆257Updated last week
- Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 构建智能的多模态AI Agent。☆194Updated last week