percent4/multi-modal-image-search

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/percent4/multi-modal-image-search)

percent4 / multi-modal-image-search

本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。

☆28

Alternatives and similar repositories for multi-modal-image-search

Users that are interested in multi-modal-image-search are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sophgo / sophon-sail
View on GitHub
Guide to deploying deep-learning inference networks and deep vision primitives on SOPHON TPU.
☆20Jul 2, 2026Updated 3 weeks ago
NickLucche / image_segmentation
View on GitHub
Image Segmentation using k-means, n-cuts and superpixels
☆11Mar 31, 2019Updated 7 years ago
Form2Seq-Data / Dataset
View on GitHub
Dataset corresponding to the paper: "Form2Seq : A Framework for Higher-Order Form Structure Extraction"
☆10Feb 17, 2021Updated 5 years ago
qhgz2013 / HyperDNE
View on GitHub
☆10Jul 30, 2023Updated 2 years ago
mvijaikumar / HyperTeNet
View on GitHub
☆12Oct 4, 2021Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
ant-research / M2-Miner
View on GitHub
[ICLR 2026] M2-Miner: Multi-Agent Enhanced MCTS for Mobile GUI Agent Data Mining
☆55Apr 22, 2026Updated 3 months ago
percent4 / llm_4_doc_qa
View on GitHub
本项目用于文档问答，使用向量嵌入 + ES 做召回，使用Rerank模型作为精排，再使用LLM做文档问答，Web框架使用Flask。
☆34Mar 17, 2025Updated last year
BlarkLee / MonoPLFlowNet
View on GitHub
ECCV 2022, MonoPLFlowNet
☆10Jun 14, 2024Updated 2 years ago
copawloroous / SGMAE
View on GitHub
[TGRS 2025] Self-Supervised Graph Masked Autoencoders for Hyperspectral Image Classification
☆15Jun 2, 2026Updated last month
forwchen / LLaVA-MoLE
View on GitHub
☆10Mar 4, 2024Updated 2 years ago
jianzhu / dl-rerank
View on GitHub
☆11May 8, 2020Updated 6 years ago
mahmad00 / Conventional-to-Transformer-for-Hyperspectral-Image-Classification-Survey-2024
View on GitHub
☆16May 11, 2024Updated 2 years ago
yuyang95 / JAG-MSNet
View on GitHub
Multi-stage convolutional autoencoder network for hyperspectral unmixing
☆16Jun 7, 2024Updated 2 years ago
etienne-monier / lib-unmixing
View on GitHub
Python3 library for common unmixing functions
☆15Oct 2, 2018Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
OneForward / ResMHGNN
View on GitHub
Source code for the paper Residual Enhanced Multi-Hypergraph Neural Network (ICIP 2021).
☆19Jul 8, 2021Updated 5 years ago
HKUDS / RCL
View on GitHub
[Recsys'2023] "RCL: Multi-Relational Contrastive Learning for Recommendation"
☆16Sep 6, 2023Updated 2 years ago
Lhai0704 / social_network_simulation
View on GitHub
LLM驱动的社交网络模拟系统; LLM-driven social network simulation system
☆10Oct 13, 2024Updated last year
qichaoliu / HSI-CNCMN
View on GitHub
Q. Liu, L. Xiao, N. Huang and J. Tang, "Composite Neighbor-Aware Convolutional Metric Networks for Hyperspectral Image Classification," i…
☆14Sep 5, 2023Updated 2 years ago
360CVGroup / 360VL
View on GitHub
Our 2nd-gen LMM
☆34May 22, 2024Updated 2 years ago
AIoT-MLSys-Lab / MEDA
View on GitHub
[NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference
☆22Jun 19, 2025Updated last year
yamato0811 / streamlit-langgraph-HITL-copy-generator
View on GitHub
StreamlitとLangGraphで実装したHuman-in-the-loop広告コピー文生成アプリケーション
☆11Feb 15, 2025Updated last year
WuXinglong-HIT / CLIPER
View on GitHub
☆12Jul 7, 2024Updated 2 years ago
JSJeong-me / GPT-Table
View on GitHub
GPT Table Semantic Parsing with complex & non-intuitive structure.
☆17Jul 16, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
fufankeji / PaddleOCR-MultiRAG
View on GitHub
Traceable multimodal RAG system powered by PaddleOCR-VL , Every answer has its source
☆27Oct 29, 2025Updated 9 months ago
Bklight999 / world-knowledge
View on GitHub
a novel self-evolving paradigm, without task, reward, or complex workflow
☆36May 12, 2026Updated 2 months ago
hubo0417 / EasyGC
View on GitHub
集成了LLM与SDXL的AIGC应用程序
☆29Jan 3, 2024Updated 2 years ago
danfenghong / IEEE_TGRS_SeCoDe
View on GitHub
Matlab code of the TGRS paper entitled "Sparsity-Enhanced Convolutional Decomposition: A Novel Tensor-Based Paradigm for Blind Hyperspect…
☆20Apr 10, 2021Updated 5 years ago
BehnoodRasti / MiSiCNet
View on GitHub
MiSiCNet: Minimum Simplex Convolutional Network for Deep Hyperspectral Unmixing
☆19Feb 8, 2022Updated 4 years ago
yqtian-se / EvalDNN
View on GitHub
EvalDNN: A Toolbox for Evaluating Deep Neural Network Models
☆14Mar 9, 2020Updated 6 years ago
UCDvision / low-budget-al
View on GitHub
PyTorch implementation of "A Simple Baseline for Low-Budget Active Learning".
☆14Dec 22, 2021Updated 4 years ago
Tony-Hu-yh / review-based-recommendation
View on GitHub
使用BERT预训练语言模型获取评论文本的向量表示，通过Bi-GRU网络学习其中的语义特征，分别采用情感权重和注意力机制来为评论向量分配权重，动态调节其对用户特征和产品特征的影响程度，并以加权求和的方式获得用户特征和产品特征，最后利用DeepFM算法对用户特征和产品特征进行深…
☆16Mar 28, 2023Updated 3 years ago
sisl / LOPR
View on GitHub
This is the official implementation of LOPR used in "LOPR: Latent Occupancy PRediction using Generative Models"
☆19Aug 21, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
nzjin / awesome_moe
View on GitHub
The collections of MOE (Mixture Of Expert) papers, code and tools, etc.
☆12Mar 15, 2024Updated 2 years ago
shubham99bisht / Expense-Tracker
View on GitHub
Key Information Extraction from Scanned Receipts: The aim of this project is to extract texts of a number of key fields from given receip…
☆17Oct 12, 2021Updated 4 years ago
DeepReasoning / NeuLR
View on GitHub
content-neutral dataset of logical reasoning
☆20Mar 21, 2025Updated last year
SCZwangxiao / video-ReTaKe
View on GitHub
Official implementation of paper ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding
☆40Mar 16, 2025Updated last year
ilias-ant / hyperspectral-images
View on GitHub
Processing of HSIs: spectral unmixing and classification.
☆21Apr 2, 2021Updated 5 years ago
hamza08003 / DRL-based-UAV-Path-Planning
View on GitHub
This repository contains source code for Multi UAV Task Assignment and Path Planning
☆18Dec 5, 2025Updated 7 months ago
Chen-GX / C-3PO
View on GitHub
[ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…
☆44May 3, 2025Updated last year