☆22Aug 27, 2025Updated 7 months ago
Alternatives and similar repositories for LLaVA-Unified
Users that are interested in LLaVA-Unified are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Aug 7, 2024Updated last year
- ☆10Jul 13, 2024Updated last year
- ☆47Nov 8, 2024Updated last year
- ☆37Sep 16, 2024Updated last year
- An implementation of several unsupervised object discovery models (Slot Attention, SLATE, GNM) in PyTorch with pre-trained models.☆15May 26, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Feb 25, 2024Updated 2 years ago
- Placeholder☆10Jul 17, 2023Updated 2 years ago
- Official implementation of EMNLP 2021 Paper "Rethinking Zero-shot Neural Machine Translation: From a Perspective of Latent Variables"☆12May 15, 2023Updated 2 years ago
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- ☆12Aug 31, 2021Updated 4 years ago
- Visual Looming: Frontal obstacle avoidance using monocular camera for UAV☆15Apr 23, 2017Updated 8 years ago
- ☆31Jun 9, 2025Updated 10 months ago
- ☆15May 23, 2022Updated 3 years ago
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆55Apr 3, 2026Updated 2 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Original tensorflow implementation of SILOT (Spatially Invariant, Label-free Object Tracking).☆13Mar 24, 2023Updated 3 years ago
- Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"☆19Jan 18, 2026Updated 3 months ago
- Official PyTorch implementation of paper "Schema Inference for Interpretable Image Classification" (ICLR 2023)☆15Apr 6, 2023Updated 3 years ago
- Official repository for the paper "Random Shuffle Transformer for Image Restoration".☆17Jan 9, 2024Updated 2 years ago
- Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasonin…☆10Jun 16, 2024Updated last year
- Code release for VTW (AAAI 2025 Oral)☆66Nov 4, 2025Updated 5 months ago
- ☆17Oct 20, 2020Updated 5 years ago
- Official Implementation (Pytorch) of "Super-class guided Transformer for Zero-Shot Attribute Classification", AAAI 2025☆15Jan 15, 2025Updated last year
- ☆12Dec 15, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The official implement of "Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings"☆18Dec 5, 2024Updated last year
- Adapt MLLMs to Domains via Post-Training (EMNLP 2025 Findings)☆14Nov 11, 2025Updated 5 months ago
- Vision-Language Pre-Training for Boosting Scene Text Detectors (CVPR2022)☆12Mar 21, 2022Updated 4 years ago
- The code of WEAKLY SUPERVISED NUCLEI SEGMENTATION VIA INSTANCE LEARNING☆17Apr 10, 2023Updated 3 years ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆26Jan 14, 2025Updated last year
- [ECCV '24] On the Utility of 3D Hand Poses for Action Recognition☆18Jun 8, 2025Updated 10 months ago
- [CIKM 2022] Towards Automated Over-Sampling for Imbalanced Classification☆10Mar 20, 2023Updated 3 years ago
- Official implementation of AAAI'2022 paper "Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement"☆17Dec 23, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official repository of ECCV 2024 paper - "HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization"☆19Aug 23, 2024Updated last year
- CLIPCleaner: Cleaning Noisy Labels with CLIP (ACM MM2024)☆15Apr 28, 2025Updated 11 months ago
- [CVPR' 25] Official implementation of the paper "Pseudo Visible Feature Fine-Grained Fusion for Thermal Object Detection"☆23Aug 29, 2025Updated 7 months ago
- Official implementation of paper ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding☆40Mar 16, 2025Updated last year
- rebert model codes based on fariseq☆15Feb 28, 2021Updated 5 years ago
- MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension☆53Dec 3, 2024Updated last year
- Sentiment polarity annotations dataset☆26Nov 28, 2017Updated 8 years ago