li-xiu-qi / SmartlmagerLinks
一个基于多模态向量模型及视觉多模态模型构建的图片搜索引擎&管理系统,实现精准的以文搜文,文搜图、以图搜图多种智能检索方式。An image search engine management system built upon multimodal vector models and visual multimodal models, implementing multiple intelligent search methods including precise text-to-text, text-to-image, and image-to-image retrieval.
☆68Updated last month
Alternatives and similar repositories for Smartlmager
Users that are interested in Smartlmager are comparing it to the libraries listed below
Sorting:
- 筱可的工程实验仓库!☆96Updated last week
- 本项目借助飞桨平台,构建起一套创新的多模型协同系统,实现 PDF 文件到 Markdown 文件的高效、精准转换。☆27Updated 7 months ago
- A mini assistant to help you read paper quickly☆54Updated 6 months ago
- ragflow中的ocr部分,非官方项目☆51Updated last year
- 视频理解:千问视频多模态模型 & Dify☆65Updated last year
- A unified tool to generate fine-tuning datasets for LLMs, including questions, answers, and dialogues. ✨🤖📚💬☆63Updated 7 months ago
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。☆218Updated last week
- 模版式PPT,可以生成套用模版的PPT☆203Updated 2 weeks ago
- 使用煤矿历史事故案例,事故处理报告、安全规程规章制度、技术文档、煤矿从业人员入职考试题库等数据,微调internlm2模型实现针对煤矿事故和煤矿安全知识的智能问答。☆54Updated 10 months ago
- 支持中文🇨🇳🇨🇳🇨🇳 的 microsoft/graphrag☆51Updated 7 months ago
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆60Updated 5 months ago
- 一个面向多模态大模型训练的智能数据集构建与评估平台☆136Updated last month
- ☆152Updated 8 months ago
- 使用FastAPI+vLLM部署Qwen2.5☆24Updated last year
- Convert files into markdown to help RAG or LLM understand, based on markitdown and MinerU, which could provide high quality pdf parser.☆129Updated 7 months ago
- Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 构建智能的多模态AI Agent。☆217Updated 2 weeks ago
- [ACL2025 demo track] ROGRAG: A Robustly Optimized GraphRAG Framework☆179Updated last week
- ☆54Updated 8 months ago
- An AI-powered content conversion tool that transforms text, web content, or HTML code into beautifully designed card images.一款基于AI的内容转换工…☆32Updated 3 months ago
- ☆30Updated last month
- 全方位大模型评测知识库 | 提示词工程(Prompt Engineer)、各渠道大模型榜单(LeaderBoard)、标杆数据集、安全检测、对抗攻击、智能体、优质数据、文本分类、关系抽取、语音识别、语音合成、多模态、文本生成图片、文本生成视频、点云、智能对话、摘要总结、问答…☆74Updated 11 months ago
- Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源…☆111Updated 3 months ago
- 一些大语言模型和多模态模型的生态,主要包括跨模态搜索、投机解码、QAT量化、多模态量化、ChatBot、OCR☆193Updated 2 months ago
- ☆272Updated 10 months ago
- generate ppt with llm☆102Updated last year
- 基于LangGraph开发的智能体项目,可借助大模型自动调用工具规划旅游行程,包括景点搜索、交通查询、饭店酒店查询等功能☆31Updated last year
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆185Updated 11 months ago
- 一个用于BiliBili网站实时热点&舆情分析的AI 智能体☆82Updated 11 months ago
- Here is a demo for PDF parser (Including OCR, object detection tools)☆36Updated last year
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆26Updated last year