DAILtech / LLaVA-Deploy-GuideLinks
💻 Tutorial for deploying LLaVA (Large Language & Vision Assistant) on Ubuntu + CUDA – step-by-step guide with CLI & web UI.
☆15Updated 4 months ago
Alternatives and similar repositories for LLaVA-Deploy-Guide
Users that are interested in LLaVA-Deploy-Guide are comparing it to the libraries listed below
Sorting:
- [AAAI 2024] Prompt-based Distribution Alignment for Unsupervised Domain Adaptation☆73Updated 11 months ago
- ☆15Updated 8 months ago
- PMR: Prototypical Modal Rebalance for Multimodal Learning☆40Updated 2 years ago
- A curated list of balanced multimodal learning methods.☆104Updated this week
- ☆35Updated 5 months ago
- Deep Correlated Prompting for Visual Recognition with Missing Modalities (NeurIPS 2024)☆29Updated 6 months ago
- The official GitHub page for the survey paper "CLIP-Powered Domain Generalization and Domain Adaptation: A Comprehensive Survey". And thi…☆52Updated last month
- Code for "Leveraging Knowledge of Modality Experts for Incomplete Multimodal Learning" accepted by ACM Multimedia 2024☆32Updated 8 months ago
- The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024☆54Updated 10 months ago
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆45Updated last year
- [ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models☆17Updated last year
- Code for UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning (ACL 2023)☆36Updated last year
- Probabilistic Contrastive Learning for Domain Adaptation☆14Updated last year
- The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)☆285Updated this week
- ☆32Updated 10 months ago
- [ICML 2024] Official implementation for "Predictive Dynamic Fusion."☆63Updated 8 months ago
- An official implementation of "Incomplete Multimodality-Diffused Emotion Recognition" in PyTorch. (NeurIPS 2023)☆55Updated last year
- The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".☆82Updated 5 months ago
- AAAI-24 Decoupled Contrastive Learning for Long-Tailed Recognition☆32Updated last year
- MAE for CIFAR,由于可用资源有限,我们仅在 cifar10 上测试模型。我们主要想重现这样的结果:使用 MAE 预训练 ViT 可以比直接使用标签进行监督学习训练获得更好的结果。这应该是自我监督学习比监督学习更有效的数据的证据。☆78Updated 2 years ago
- This repository is a collection of awesome things about vision prompts, including papers, code, etc.☆36Updated last year
- Adaptation of vision-language models (CLIP) to downstream tasks using local and global prompts.☆48Updated 2 months ago
- ICCV 2023☆11Updated last year
- An official implementation of "Distribution-Consistent Modal Recovering for Incomplete Multimodal Learning" in PyTorch. (ICCV 2023)☆31Updated last year
- The repo for "Diagnosing and Re-learning for Balanced Multi-modal Learning", ECCV 2024☆27Updated last year
- Source code for the paper "Long-Tail Learning with Foundation Model: Heavy Fine-Tuning Hurts" (ICML 2024)☆89Updated 11 months ago
- Pytorch implementation of "Test-time Adaption against Multi-modal Reliability Bias".☆40Updated 9 months ago
- The official code repository of ShaSpec model from CVPR 2023 [paper](https://arxiv.org/pdf/2307.14126) "Multi-modal Learning with Missing…☆76Updated 5 months ago
- This is a repository for organizing papers ,codes, and etc related to Domain Generalization☆24Updated 2 years ago
- DPStyler: Dynamic PromptStyler for Source-Free Domain Generalization☆13Updated last year