本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。
☆28Feb 26, 2024Updated 2 years ago
Alternatives and similar repositories for multi-modal-image-search
Users that are interested in multi-modal-image-search are comparing it to the libraries listed below
Sorting:
- 通用数字人系统是一个基于深度学习和WebRTC技术的智能交互平台,集成了Azure Avatar数字人渲染、语音识别合成、自然语言处理等技术。系统支持实时对话、知识问答和情感交互,可实现30FPS以上的流畅渲染和200ms以内的低延迟响应。核心功能包括基于GPT的智能对话、…☆28Dec 17, 2025Updated 2 months ago
- 大模型训练、推理、推荐系统相关☆30Nov 30, 2025Updated 3 months ago
- Our 2nd-gen LMM☆34May 22, 2024Updated last year
- A Odata compliant Query Builder built using Dotnet Standard 2.0 for MongoDB, SQL, Azure Cosmos Db, In Memory database☆12Jan 6, 2023Updated 3 years ago
- StreamlitとLangGraphで実装したHuman-in-the-loop広告コピー文生成アプリケーション☆11Feb 15, 2025Updated last year
- Automatic defect recognition in X-ray testing using computer vision☆12Dec 8, 2018Updated 7 years ago
- Dataset corresponding to the paper: "Form2Seq : A Framework for Higher-Order Form Structure Extraction"☆10Feb 17, 2021Updated 5 years ago
- ☆40Oct 17, 2024Updated last year
- 在RAG技术中,嵌入向量的生成和匹配是关键环节。本文介绍了一种基于CLIP/BLIP模型的嵌入服务,该服务支持文本和图像的嵌入生成与相似度计算,为多模态信息检索提供了基础能力。☆42Dec 28, 2024Updated last year
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆42May 3, 2025Updated 10 months ago
- ☆11Oct 31, 2024Updated last year
- ☆22Dec 11, 2025Updated 2 months ago
- ☆18Feb 16, 2025Updated last year
- 增加了indextts2的简单的界面与api调用方式☆21Oct 27, 2025Updated 4 months ago
- The official repository for paper "LLMaAA: Making Large Language Models as Active Annotators"☆44Apr 14, 2024Updated last year
- Code for TensorRT inference of LaneATT model☆44Apr 29, 2022Updated 3 years ago
- BiLSTM+CRF☆10Jan 15, 2019Updated 7 years ago
- [ECCV 2022] "TALISMAN: Targeted Active Learning for Object Detection with Rare Classes and Slices using Submodular Mutual Information" by…☆10Sep 21, 2022Updated 3 years ago
- ESP32 web server displaying frames from a 'OV7670+AL422 FIFO' camera module☆12Dec 6, 2017Updated 8 years ago
- ☆10Jul 30, 2023Updated 2 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- ☆21Jun 16, 2025Updated 8 months ago
- DE1SoC VGA and Audio☆10Jan 11, 2017Updated 9 years ago
- mobileNet SSD 基于caffe的前向检测☆10Nov 30, 2018Updated 7 years ago
- Code for paper "Targeted Sentiment Classification Based on Attentional Encoding and Graph Convolutional Networks"☆10Mar 8, 2020Updated 5 years ago
- A Fast Image Converter thats supports common image formats. It's using WebAssembly for all conversions so no image is sent to the server…☆11Jul 10, 2025Updated 7 months ago
- ☆12Mar 1, 2023Updated 3 years ago
- A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)☆15Oct 18, 2021Updated 4 years ago
- ☆11May 8, 2020Updated 5 years ago
- ppt转数字人后台☆18Apr 9, 2025Updated 10 months ago
- A custom watcher plugin for Elasticsearch that feeds Apache Kafka☆11Mar 9, 2018Updated 7 years ago
- 团队协作沟通软件,打算做成开源免费的钉钉/飞书,预期使用5年时间完成这个巨大的目标。☆11May 29, 2024Updated last year
- A vote and lottery App for Wechat☆13Dec 16, 2013Updated 12 years ago
- Long Context Research☆29Jan 26, 2026Updated last month
- Superpixel segmentation using SLIC and Felzenszwalb.☆12Apr 6, 2021Updated 4 years ago
- ☆14Dec 3, 2025Updated 3 months ago
- 在index-tts-vllm的基础上,实现了并提供了模拟流式合成音频的接口服务及客户端测试脚本☆27Sep 2, 2025Updated 6 months ago
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆18Mar 7, 2025Updated last year
- cv-warpPolar-exampleは、OpenCVでの極座標変換/逆変換の実行例です。☆11Jul 11, 2020Updated 5 years ago