percent4 / multi-modal-image-searchView external linksLinks
本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。
☆28Feb 26, 2024Updated last year
Alternatives and similar repositories for multi-modal-image-search
Users that are interested in multi-modal-image-search are comparing it to the libraries listed below
Sorting:
- 通用数字人系统是一个基于深度学习和WebRTC技术的智能交互平台,集成了Azure Avatar数字人渲染、语音识别合成、自然语言处理等技术。系统支持实时对话、知识问答和情感交互,可实现30FPS以上的流畅渲染和200ms以内的低延迟响应。核心功能包括基于GPT的智能对话、…☆27Dec 17, 2025Updated last month
- learning project☆24Mar 27, 2024Updated last year
- Our 2nd-gen LMM☆34May 22, 2024Updated last year
- 本项目用于文档问答,使用向量嵌入 + ES 做召回,使用Rerank模型作为精排,再使用LLM做文档问答,Web框架使用Flask。☆33Mar 17, 2025Updated 10 months ago
- Dataset corresponding to the paper: "Form2Seq : A Framework for Higher-Order Form Structure Extraction"☆10Feb 17, 2021Updated 4 years ago
- Automatic defect recognition in X-ray testing using computer vision☆12Dec 8, 2018Updated 7 years ago
- StreamlitとLangGraphで実装したHuman-in-the-loop広告コピー文生成アプリケーション☆11Feb 15, 2025Updated 11 months ago
- A Odata compliant Query Builder built using Dotnet Standard 2.0 for MongoDB, SQL, Azure Cosmos Db, In Memory database☆12Jan 6, 2023Updated 3 years ago
- ☆40Oct 17, 2024Updated last year
- ☆18Feb 16, 2025Updated 11 months ago
- ☆11Oct 31, 2024Updated last year
- ☆22Dec 11, 2025Updated 2 months ago
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆41May 3, 2025Updated 9 months ago
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated 10 months ago
- GPT Table Semantic Parsing with complex & non-intuitive structure.☆17Jul 16, 2025Updated 6 months ago
- The official repository for paper "LLMaAA: Making Large Language Models as Active Annotators"☆44Apr 14, 2024Updated last year
- Code for TensorRT inference of LaneATT model☆44Apr 29, 2022Updated 3 years ago
- mobileNet SSD 基于caffe的前向检测☆10Nov 30, 2018Updated 7 years ago
- ☆11May 8, 2020Updated 5 years ago
- A vote and lottery App for Wechat☆13Dec 16, 2013Updated 12 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- Long Context Research☆26Jan 26, 2026Updated 2 weeks ago
- ☆10Apr 20, 2019Updated 6 years ago
- Code for paper "Targeted Sentiment Classification Based on Attentional Encoding and Graph Convolutional Networks"☆10Mar 8, 2020Updated 5 years ago
- A custom watcher plugin for Elasticsearch that feeds Apache Kafka☆11Mar 9, 2018Updated 7 years ago
- [ECCV 2022] "TALISMAN: Targeted Active Learning for Object Detection with Rare Classes and Slices using Submodular Mutual Information" by…☆10Sep 21, 2022Updated 3 years ago
- An open-source Agent Skill framework implementing progressive disclosure architecture☆40Jan 30, 2026Updated 2 weeks ago
- A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)☆15Oct 18, 2021Updated 4 years ago
- ☆21Jun 16, 2025Updated 7 months ago
- DE1SoC VGA and Audio☆10Jan 11, 2017Updated 9 years ago
- Improve your organization's efficiency with our innovative ERP solution! Built using C#, .NET Web API, Docker, RabbitMQ with MassTransit,…☆17Jul 31, 2024Updated last year
- BiLSTM+CRF☆10Jan 15, 2019Updated 7 years ago
- A Fast Image Converter thats supports common image formats. It's using WebAssembly for all conversions so no image is sent to the server…☆11Jul 10, 2025Updated 7 months ago
- ☆12Mar 1, 2023Updated 2 years ago
- ☆10Nov 28, 2022Updated 3 years ago
- Face++ 是一款基于 Android 平台开发的创新性 AI 面相分析应用。它巧妙地将中国传统面相学理论(如“三庭五眼”和“十二宫”)与现代人工智能技术相结合,为用户提供一份专业、详尽且富有洞察力的面相分析报告☆21Jul 14, 2025Updated 7 months ago
- [NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference☆17Jun 19, 2025Updated 7 months ago
- ExtractChar could detect Chinese character with SWT and connectedComponentsWithStats.☆10Dec 24, 2023Updated 2 years ago
- Awesome Self-Supervised Vision Learning☆11Mar 27, 2024Updated last year