DAILtech / LLaVA-Deploy-GuideLinks
π» Tutorial for deploying LLaVA (Large Language & Vision Assistant) on Ubuntu + CUDA β step-by-step guide with CLI & web UI.
β16Updated 5 months ago
Alternatives and similar repositories for LLaVA-Deploy-Guide
Users that are interested in LLaVA-Deploy-Guide are comparing it to the libraries listed below
Sorting:
- β15Updated 8 months ago
- β21Updated 6 months ago
- [AAAI 2024] Prompt-based Distribution Alignment for Unsupervised Domain Adaptationβ73Updated last year
- β35Updated 6 months ago
- PMR: Prototypical Modal Rebalance for Multimodal Learningβ42Updated 2 years ago
- Deep Correlated Prompting for Visual Recognition with Missing Modalities (NeurIPS 2024)β29Updated 7 months ago
- A curated list of balanced multimodal learning methods.β119Updated this week
- The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024β56Updated 11 months ago
- Code for dmrnetβ28Updated 3 months ago
- The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".β82Updated 5 months ago
- β33Updated 10 months ago
- AAAI-24 Decoupled Contrastive Learning for Long-Tailed Recognitionβ32Updated last year
- [ICML 2023] Provable Dynamic Fusion for Low-Quality Multimodal Dataβ111Updated 3 months ago
- Code for "Leveraging Knowledge of Modality Experts for Incomplete Multimodal Learning" accepted by ACM Multimedia 2024β32Updated 9 months ago
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024β46Updated last year
- Code for the paper 'Dynamic Multimodal Fusion'β117Updated 2 years ago
- Code for UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning (ACL 2023)β37Updated last year
- Pytorch implementation of "Test-time Adaption against Multi-modal Reliability Bias".β40Updated 9 months ago
- offical code for MMANet: Margin-aware Distillation and Modality-aware Regularization for Incomplete Multimodal Learningβ43Updated last year
- Code for the paper Visual Explanations of ImageβText Representations via Multi-Modal Information Bottleneck Attributionβ58Updated last year
- Multimodal Prompting with Missing Modalities for Visual Recognition, CVPR'23β218Updated last year
- An official implementation of "Distribution-Consistent Modal Recovering for Incomplete Multimodal Learning" in PyTorch. (ICCV 2023)β31Updated 2 years ago
- The official GitHub page for the survey paper "CLIP-Powered Domain Generalization and Domain Adaptation: A Comprehensive Survey". And thiβ¦β53Updated 2 months ago
- The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)β287Updated 3 weeks ago
- An official implementation of "Incomplete Multimodality-Diffused Emotion Recognition" in PyTorch. (NeurIPS 2023)β57Updated last year
- [ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Modelsβ17Updated last year
- The repo for "Diagnosing and Re-learning for Balanced Multi-modal Learning", ECCV 2024β27Updated last year
- [ICML 2024] Official implementation for "Predictive Dynamic Fusion."β65Updated 9 months ago
- The official code repository of ShaSpec model from CVPR 2023 [paper](https://arxiv.org/pdf/2307.14126) "Multi-modal Learning with Missingβ¦β77Updated 6 months ago
- Adaptation of vision-language models (CLIP) to downstream tasks using local and global prompts.β48Updated 3 months ago