☆48May 24, 2023Updated 2 years ago
Alternatives and similar repositories for LLaVA
Users that are interested in LLaVA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Focal Transformer for Boundary-aware Prostate Segmentation using CT Images☆11Nov 10, 2024Updated last year
- [RSS 23] Dynamic-Resolution Model Learning for Object Pile Manipulation☆36Jan 29, 2024Updated 2 years ago
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding☆63Oct 22, 2024Updated last year
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings☆11Feb 24, 2025Updated last year
- Host CIFAR-10.2 Data Set☆13Sep 22, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆24Feb 5, 2024Updated 2 years ago
- [TPAMI 2023] Object Affinity Learning: Towards Annotation-free Instance Segmentation☆14Sep 14, 2023Updated 2 years ago
- ☆11Nov 5, 2024Updated last year
- VLA-RFT: Vision-Language-Action Models with Reinforcement Fine-Tuning☆141Oct 6, 2025Updated 6 months ago
- ☆14Jul 5, 2023Updated 2 years ago
- ☆12Sep 10, 2019Updated 6 years ago
- Python bindings for the Nvidia FleX simulator☆13Feb 4, 2019Updated 7 years ago
- ☆11May 24, 2024Updated last year
- Code for Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization.☆10Sep 28, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Visual Relationship Reasoning for Grasp Planning☆19May 22, 2025Updated 10 months ago
- [AAAI 2026 Oral] SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation☆39Apr 5, 2026Updated last week
- [CVPR 2025 🔥]A Large Multimodal Model for Pixel-Level Visual Grounding in Videos☆99Apr 14, 2025Updated last year
- Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation (NeurIPS 23)☆12May 7, 2025Updated 11 months ago
- [2024][MICCAI] LLM-guided Multi-modal Multiple Instance Learning for 5-year Overall Survival Prediction of Lung Cancer☆18Mar 30, 2026Updated 2 weeks ago
- [ICLR 2023] PyTorch code for DFPC: Data flow driven pruning of coupled channels without data.☆15Aug 25, 2023Updated 2 years ago
- This repository contains the code to our Paper: Medical Transformer for Multimodal Survival Prediction in Intensive Care - Integration of…☆19May 15, 2023Updated 2 years ago
- 🐝 | From Data to Prognosis: Embedding Multimodal Oncology Data for Precision Medicine☆39Feb 23, 2026Updated last month
- This is the official code for NeurIPS 2023 paper "Learning Unseen Modality Interaction"☆18Jan 22, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [CVPR 24] This is official implication for our paper: ''CroSel: Cross Selection of Confident Pseudo Labels for Partial-Label Learning''.☆16Apr 27, 2025Updated 11 months ago
- A Framework for Symbolic MUsic Graph Explanations☆10Jul 30, 2025Updated 8 months ago
- "Visual Prompt Selection for In-Context Learning Segmentation Framework"☆15Dec 13, 2024Updated last year
- ☆35Nov 25, 2025Updated 4 months ago
- ☆12Jun 1, 2024Updated last year
- [ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization☆14Nov 27, 2024Updated last year
- data augmentation alone can improve adversarial training☆15Mar 24, 2023Updated 3 years ago
- Official implementation of A Mixture of Surprises for Unsupervised Reinforcement Learning☆23Nov 16, 2022Updated 3 years ago
- The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (arXiv 2025)☆41May 30, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Repository for "Training Audio Captioning Models without Audio"☆10Sep 26, 2023Updated 2 years ago
- ☆13Oct 23, 2018Updated 7 years ago
- CaMML:Context-Aware MultiModal Learner for Large Models (ACL 2024 SAC Award)☆15May 21, 2025Updated 10 months ago
- Official Implementation of Towards Open Vocabulary Video Semantic Segmentation☆14Feb 27, 2025Updated last year
- ☆24Jun 5, 2025Updated 10 months ago
- ☆15Mar 21, 2025Updated last year
- The PyTorch implementation for "DEAL: Disentangle and Localize Concept-level Explanations for VLMs" (ECCV 2024 Strong Double Blind)☆20Mar 9, 2026Updated last month