本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。
☆28Feb 26, 2024Updated 2 years ago
Alternatives and similar repositories for multi-modal-image-search
Users that are interested in multi-modal-image-search are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference☆19Jun 19, 2025Updated 9 months ago
- ☆10Jul 30, 2023Updated 2 years ago
- ☆12Oct 4, 2021Updated 4 years ago
- UniG-Encoder: A Universal Feature Encoder for Graph and Hypergraph Node Classification.☆13Jul 18, 2025Updated 8 months ago
- Image Segmentation using k-means, n-cuts and superpixels☆11Mar 31, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 在RAG技术中,嵌入向量的生成和匹配是关键环节。本文介绍了一种基于CLIP/BLIP模型的嵌入服务,该服务支持文本和图像的嵌入生成与相似度计算,为多模态信息检索提供了基础能力。☆42Dec 28, 2024Updated last year
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆18Mar 7, 2025Updated last year
- 本项目用于文档问答,使用向量嵌入 + ES 做召回,使用Rerank模型作为精排,再使用LLM做文档问答,Web框架使用Flask。☆34Mar 17, 2025Updated last year
- Superpixel segmentation using SLIC and Felzenszwalb.☆13Apr 6, 2021Updated 5 years ago
- 基于ChatGLM3-6b的智能对话系统,集成了RAG、知识图谱、Agent 、多模态等技术来增强大模型的回复质量。☆66Aug 12, 2024Updated last year
- ☆16May 11, 2024Updated last year
- 用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.☆17Feb 29, 2024Updated 2 years ago
- ☆12Jul 7, 2024Updated last year
- 运用图卷积网络对节点分类☆11Mar 23, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆31Dec 8, 2025Updated 4 months ago
- Multi-stage convolutional autoencoder network for hyperspectral unmixing☆15Jun 7, 2024Updated last year
- Python3 library for common unmixing functions☆15Oct 2, 2018Updated 7 years ago
- Source code for the paper Residual Enhanced Multi-Hypergraph Neural Network (ICIP 2021).☆19Jul 8, 2021Updated 4 years ago
- [NeurIPS 2025] Official Implementation of ViSpec: Accelerating Vision-Language Models with Vision-Aware Speculative Decoding.☆51Jan 28, 2026Updated 2 months ago
- Our 2nd-gen LMM☆34May 22, 2024Updated last year
- Codes for Paper: From Hypergraph Energy Functions to Hypergraph Neural Networks☆23Jun 29, 2023Updated 2 years ago
- [Recsys'2023] "RCL: Multi-Relational Contrastive Learning for Recommendation"☆16Sep 6, 2023Updated 2 years ago
- ☆12Aug 3, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- CTE: Contextualized Table Extraction Dataset☆17Feb 23, 2023Updated 3 years ago
- The collections of MOE (Mixture Of Expert) papers, code and tools, etc.☆12Mar 15, 2024Updated 2 years ago
- 使用BERT预训练语言模型获取评论文本的向量表示,通过Bi-GRU网络学习其中的语义特征,分别采用情感权重和注意力机制来为评论向量分配权重,动态调节其对用户特征和产品特征的影响程度,并以加权求和的方式获得用户特征和产品特征,最后利用DeepFM算法对用户特征和产品特征进行深…☆16Mar 28, 2023Updated 3 years ago
- 在index-tts-vllm的基础上,实现了并提供了模拟流式合成音频的接口服务及客户端测试脚本☆26Sep 2, 2025Updated 7 months ago
- ☆11May 8, 2020Updated 5 years ago
- 中文基于满血DeepSeek-R1蒸馏数据集☆64Feb 21, 2025Updated last year
- 毕业设计项目(基于opencv车牌识别的停车场收费系统)☆12Jul 16, 2022Updated 3 years ago
- Code for the paper "Rotation Equivariant Graph Convolutional Network for Spherical Image Classification", CVPR 2020☆21Aug 12, 2020Updated 5 years ago
- 集成了LLM与SDXL的AIGC应用程序☆29Jan 3, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 增加了indextts2的简单的界面与api调用方式☆27Oct 27, 2025Updated 5 months ago
- About Codes for ACL 2023 paper: Exploiting! Multimodal Relation Extraction with Feature Denoising and Multimodal Topic Modeling.☆20Jun 25, 2024Updated last year
- [SEKE 2023] This is the implementation code of Heterogeneous Directed Hypergraph Neural Network (HDHGN) model in the paper Heterogeneous …☆23Aug 23, 2025Updated 7 months ago
- Matlab code of the TGRS paper entitled "Sparsity-Enhanced Convolutional Decomposition: A Novel Tensor-Based Paradigm for Blind Hyperspect…☆20Apr 10, 2021Updated 5 years ago
- PyTorch implementation of "A Simple Baseline for Low-Budget Active Learning".☆14Dec 22, 2021Updated 4 years ago
- AI 驱动的虚拟数字人直播系统,支持2D/3D数字人、TTS、ASR、唇形同步、推流、互动等模块化开发。☆23May 13, 2025Updated 11 months ago
- This is the official implementation of LOPR used in "LOPR: Latent Occupancy PRediction using Generative Models"☆19Aug 21, 2023Updated 2 years ago