mapluisch/LLaVA-CLI-with-multiple-images

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mapluisch/LLaVA-CLI-with-multiple-images)

mapluisch / LLaVA-CLI-with-multiple-images

LLaVA inference with multiple images at once for cross-image analysis.

☆51

Alternatives and similar repositories for LLaVA-CLI-with-multiple-images

Users that are interested in LLaVA-CLI-with-multiple-images are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Tang-xiaoxiao / 3D-RAD
View on GitHub
[ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks
☆30Oct 28, 2025Updated 6 months ago
dosowiechi / MLMP
View on GitHub
Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation
☆28Sep 20, 2025Updated 7 months ago
HKUST-LongGroup / DyME
View on GitHub
[ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration
☆16Mar 18, 2026Updated last month
lalithjets / SurgicalGPT
View on GitHub
☆28Feb 7, 2024Updated 2 years ago
openvino-dev-samples / decode-infer-on-GPU
View on GitHub
This sample shows how to use the oneAPI Video Processing Library (oneVPL) to perform a single and multi-source video decode and preproces…
☆15Jun 15, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
luka-group / CoIN
View on GitHub
☆13Jun 11, 2024Updated last year
sammysun0711 / ov_llm_bench
View on GitHub
OpenVINO LLM Benchmark
☆11Dec 7, 2023Updated 2 years ago
intel-iot-devkit / ffmpeg-VAAPI-OpenVINO
View on GitHub
Demo on iGPU for FFmpeg decode and scale, OpenVINO inference. this is zero-copy solution, which means No frame data copy from CPU to iGPU…
☆17Jan 25, 2023Updated 3 years ago
Anttwo / MACARONS
View on GitHub
(CVPR 2023) Official code of MACARONS: Mapping And Coverage Anticipation with RGB ONline Self-supervision. Also contains an updated and i…
☆85Dec 23, 2023Updated 2 years ago
AlonMendelson / SGVL
View on GitHub
☆17Dec 13, 2023Updated 2 years ago
ZhangCYG / DDFHO
View on GitHub
☆18May 13, 2024Updated last year
forwchen / LLaVA-MoLE
View on GitHub
☆10Mar 4, 2024Updated 2 years ago
facebookresearch / threadweaver
View on GitHub
The implementation for ThreadWeaver Adaptive Threading for Efficient Parallel Reasoning in Language Models
☆56Apr 8, 2026Updated 3 weeks ago
allenai / gpv2-web10k
View on GitHub
Download Web-10K data by querying Bing Image Search
☆10Feb 1, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
HCPLab-SYSU / STKET
View on GitHub
Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation (TIP 2024, ACM MM 2023)
☆20Mar 13, 2024Updated 2 years ago
zcai0612 / InstantBooth
View on GitHub
My implement of InstantBooth
☆13Sep 11, 2023Updated 2 years ago
yeachan-kr / pytorch-coteaching
View on GitHub
Pytorch implementations of Co-teaching for noisy label learning
☆13Jun 28, 2022Updated 3 years ago
Chenfeng1271 / SVDiff
View on GitHub
Streaming Video Diffusion: Online Video Editing with Diffusion Models
☆18Jun 3, 2024Updated last year
slyalin / openvino_devtools
View on GitHub
Tools for easier OpenVINO development/debugging
☆10Jul 16, 2025Updated 9 months ago
xiaofei05 / TSST
View on GitHub
Code for EMNLP2021 paper “Transductive Learning for Unsupervised Text Style Transfer”
☆12Sep 19, 2021Updated 4 years ago
mhsung / libigl-renderer
View on GitHub
☆19Mar 14, 2023Updated 3 years ago
UMass-Embodied-AGI / CoVLM
View on GitHub
[ICLR 2023] CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding
☆46Jun 9, 2025Updated 10 months ago
microsoft / Do-You-See-Me
View on GitHub
☆11Jun 21, 2025Updated 10 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
RUC-NLPIR / SearchClaw
View on GitHub
Harness for deep search agent
☆85Updated this week
Pandapan01 / EvSSC
View on GitHub
☆11Apr 7, 2026Updated 3 weeks ago
Lackel / AGLA
View on GitHub
[CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
☆67Jul 16, 2024Updated last year
ducha-aiki / vs3-cnn-labs
View on GitHub
Computer vision labs for Vision and Sports Summer School 2022
☆10Jul 29, 2022Updated 3 years ago
showlab / mist
View on GitHub
☆37Dec 20, 2023Updated 2 years ago
PLAN-Lab / CheXRelFormer
View on GitHub
Hierarchical Vision Transformers for Disease Progression Detection in Chest X-Ray Images
☆11Jan 11, 2024Updated 2 years ago
ZackZikaiXiao / FedGraB
View on GitHub
☆16Mar 6, 2024Updated 2 years ago
jenkspt / NeuS
View on GitHub
NeuS adapted to use multires hash encoding
☆60Aug 18, 2022Updated 3 years ago
cdancette / detect-shortcuts
View on GitHub
Repo for ICCV 2021 paper: Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering
☆29Jul 1, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
GeWu-Lab / MokA
View on GitHub
MokA: Multimodal Low-Rank Adaptation for MLLMs
☆88Dec 30, 2025Updated 4 months ago
tobibaum / metrabs_trt
View on GitHub
Running 3D HPE at 30fps
☆15Jan 3, 2022Updated 4 years ago
ahmdtaha / distributed_sigmoid_loss
View on GitHub
Unofficial implementation for Sigmoid Loss for Language Image Pre-Training
☆11Sep 26, 2023Updated 2 years ago
EliSchwartz / MAEDAY
View on GitHub
☆12Nov 22, 2022Updated 3 years ago
THU-MIG / VTC-CLS
View on GitHub
official repo for paper "[CLS] Token Tells Everything Needed for Training-free Efficient MLLMs"
☆22Apr 23, 2025Updated last year
phiyodr / vqaloader
View on GitHub
PyTorch DataLoader for many VQA datasets
☆15Jan 10, 2023Updated 3 years ago
steb6 / ISBFSAR
View on GitHub
Interactive Skeleton Based Few Shot Action Recognition
☆14Nov 8, 2022Updated 3 years ago