li-xiu-qi / SmartlmageFinder
一个基于多模态向量模型及视觉多模态模型构建的图片搜索引擎&管理系统,实现精准的以文搜文,文搜图、以图搜图多种智能检索方式。An image search engine management system built upon multimodal vector models and visual multimodal models, implementing multiple intelligent search methods including precise text-to-text, text-to-image, and image-to-image retrieval.
☆11Updated this week
Alternatives and similar repositories for SmartlmageFinder:
Users that are interested in SmartlmageFinder are comparing it to the libraries listed below
- Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…☆13Updated last year
- 使用FastAPI+vLLM部署Qwen2.5☆14Updated 7 months ago
- 中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.☆11Updated 8 months ago
- Music large model based on InternLM2-chat.☆22Updated 4 months ago
- Xtuner Factory☆33Updated last year
- ☆18Updated 10 months ago
- An AI-powered content conversion tool that transforms text, web content, or HTML code into beautifully designed card images.一款基于AI的内容转换工…☆17Updated 3 weeks ago
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆50Updated this week
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆20Updated 3 months ago
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆52Updated 3 months ago
- MLLM @ Game☆13Updated last month
- paper-read-notes☆11Updated 7 months ago
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆29Updated 7 months ago
- ☆28Updated 11 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆44Updated 3 months ago
- HunyuanDiT with TensorRT and libtorch☆17Updated 11 months ago
- EfficientSAM + YOLO World base model for use with Autodistill.☆10Updated last year
- Here is a demo for PDF parser (Including OCR, object detection tools)☆34Updated 6 months ago
- 💡💡💡awesome compute vision app in gradio☆52Updated 11 months ago
- Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud☆50Updated 2 weeks ago
- WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。☆13Updated last year
- LLM RAG 应用,支持 API 调用,语音交互。☆11Updated 10 months ago
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆17Updated 7 months ago
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆13Updated 2 months ago
- 从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两层MLP投影层连…☆12Updated 2 months ago
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆123Updated 6 months ago
- LLM Tokenizer with BPE algorithm☆31Updated last year
- A mini assistant to help you read paper quickly☆40Updated 2 weeks ago
- ⛏️This is the storage of my Slides、Reports and Papers. | 存储PPT、报告和论文☆11Updated 6 months ago
- 补充了一些Visualglm缺 少的文件,可以对Visualglm进行训练,实例中是对人脸做了面相的识别☆13Updated last year