DAILtech / LLaVA-Deploy-GuideLinks
π» Tutorial for deploying LLaVA (Large Language & Vision Assistant) on Ubuntu + CUDA β step-by-step guide with CLI & web UI.
β12Updated 3 months ago
Alternatives and similar repositories for LLaVA-Deploy-Guide
Users that are interested in LLaVA-Deploy-Guide are comparing it to the libraries listed below
Sorting:
- β13Updated 6 months ago
- β20Updated 4 months ago
- [ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Modelsβ17Updated 11 months ago
- [AAAI 2024] Prompt-based Distribution Alignment for Unsupervised Domain Adaptationβ71Updated 10 months ago
- ViT Grad-CAM Visualizationβ32Updated last year
- β31Updated 4 months ago
- PMR: Prototypical Modal Rebalance for Multimodal Learningβ39Updated 2 years ago
- β33Updated 8 months ago
- β13Updated 5 months ago
- This is the official implementation of "Fuzzy Multimodal Learning for Trusted Cross-modal Retrieval" (CVPR 2025)β20Updated 2 weeks ago
- β33Updated 3 weeks ago
- An official implementation of "Incomplete Multimodality-Diffused Emotion Recognition" in PyTorch. (NeurIPS 2023)β53Updated last year
- The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024β53Updated 9 months ago
- This is the official repository for the code and datasets in the paper "Progressive Open Space Expansion for Open-Set Model Attribution",β¦β24Updated last year
- [CVPR 2025] Implementation of "Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models"β28Updated 3 months ago
- The offical implementation of [NeurIPS2024] Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation https://arβ¦β42Updated 8 months ago
- Code for the paper Visual Explanations of ImageβText Representations via Multi-Modal Information Bottleneck Attributionβ55Updated last year
- An official implementation of "Distribution-Consistent Modal Recovering for Incomplete Multimodal Learning" in PyTorch. (ICCV 2023)β31Updated last year
- [CVPR 2024] FairCLIP: Harnessing Fairness in Vision-Language Learningβ84Updated 3 weeks ago
- A list of awesome papers on AI-generated Image Detection.β44Updated 3 weeks ago
- C2P-CLIP-DeepfakeDetectionβ63Updated 4 months ago
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024β44Updated last year
- AAAI-24 Decoupled Contrastive Learning for Long-Tailed Recognitionβ32Updated last year
- A curated list of balanced multimodal learning methods.β94Updated last week
- A novel cross-modal decoupling and alignment framework for multimodal representation learning.β28Updated 4 months ago
- Code for UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning (ACL 2023)β36Updated last year
- [CVPR'25] EMOE: Modality-Specific Enhanced Dynamic Emotion Expertsβ47Updated last month
- The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".β75Updated 3 months ago
- This repository is the official implementation of StealthDiffusion: Towards Evading Diffusion Forensic Detection through Diffusion Modelβ16Updated last year
- Deep Correlated Prompting for Visual Recognition with Missing Modalities (NeurIPS 2024)β25Updated 5 months ago