li-xiu-qi / SmartlmagerLinks
一个基于多模态向量模型及视觉多模态模型构建的图片搜索引擎&管理系统,实现精准的以文搜文,文搜图、以图搜图多种智能检索方式。An image search engine management system built upon multimodal vector models and visual multimodal models, implementing multiple intelligent search methods including precise text-to-text, text-to-image, and image-to-image retrieval.
☆58Updated 2 weeks ago
Alternatives and similar repositories for Smartlmager
Users that are interested in Smartlmager are comparing it to the libraries listed below
Sorting:
- “筱可AI研习社”的工程实验仓库!☆80Updated last week
- 本项目借助飞桨平台,构建起一套创新的多模型协同系统,实现 PDF 文件到 Markdown 文件的高效、精准转换。☆25Updated 5 months ago
- 视频理解:千问视频多模态模型 & Dify☆64Updated last year
- A mini assistant to help you read paper quickly☆53Updated 4 months ago
- A unified tool to generate fine-tuning datasets for LLMs, including questions, answers, and dialogues. ✨🤖📚💬☆61Updated 5 months ago
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆59Updated 3 months ago
- Convert files into markdown to help RAG or LLM understand, based on markitdown and MinerU, which could provide high quality pdf parser.☆126Updated 5 months ago
- 使用FastAPI+vLLM部署Qwen2.5☆22Updated 11 months ago
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR和TTS的开源框架。☆208Updated this week
- 一个面向多模态大模型训练的智能数据集构建与评估平台☆117Updated last week
- 全方位大模型评测知识库 | 提示词工程(Prompt Engineer)、各渠道大模型榜单(LeaderBoard)、标杆数据集、安全检测、对抗攻击、智能体、优质数据、文本分类、关系抽取、语音识别、语音合成、多模态、文本生成图片、文本生成视频、点云、智能对话、摘要总结、问答…☆70Updated 9 months ago
- An AI-powered content conversion tool that transforms text, web content, or HTML code into beautifully designed card images.一款基于AI的内容转换工…☆29Updated last month
- 使用煤矿历史事故案例,事故处理报告、安全规程规章制度、技术文档、煤矿从业人员入职考试题库等数据,微调internlm2模型实现针对煤矿事故和煤矿安全知识的智能问答。☆51Updated 8 months ago
- [ACL2025 demo track] ROGRAG: A Robustly Optimized GraphRAG Framework☆172Updated this week
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆182Updated 9 months ago
- Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 构建智能的多模态AI Agent。☆203Updated 3 weeks ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆24Updated last year
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆219Updated 2 months ago
- ragflow中的ocr部分,非官方项目☆48Updated last year
- Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源…☆100Updated last month
- ☆265Updated 8 months ago
- 支持中文🇨🇳🇨🇳🇨🇳 的 microsoft/graphrag☆51Updated 5 months ago
- 一个用于BiliBili网站实时热点&舆情分析的AI 智能体☆78Updated 9 months ago
- ☆52Updated 6 months ago
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆534Updated 2 months ago
- 基于LangGraph开发的智能体项目,可借助大模型自动调用工具规划旅游行程,包括景点搜索、交通查询、饭店酒店查询等功能☆19Updated last year
- ☆141Updated 6 months ago
- ✨🦋 illufly - 【幻蝶】基于记忆蒸馏、资料检索的自我进化智能体☆69Updated 3 months ago
- Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud(通义点金:面向金融行业的大模型)☆321Updated 2 weeks ago
- generate ppt with llm☆101Updated last year