☆21Aug 27, 2025Updated 7 months ago
Alternatives and similar repositories for LLaVA-Unified
Users that are interested in LLaVA-Unified are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Aug 7, 2024Updated last year
- ☆47Nov 8, 2024Updated last year
- ☆37Sep 16, 2024Updated last year
- Official implementation of SIGIR 2022 Paper "Task-Oriented Dialogue System as Natural Language Generation".☆14Apr 6, 2022Updated 3 years ago
- ☆11Feb 25, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official repository for U-SAM (Interspeech 2025)☆26Jun 3, 2025Updated 9 months ago
- Code for the paper "Representing Spatial Trajectories as Distributions"☆13Jan 17, 2023Updated 3 years ago
- Official implementation of "PathReasoner-R1: Instilling Structured Reasoning into Pathology Vision-Language Model via Knowledge-Guided Po…☆22Jan 28, 2026Updated 2 months ago
- [CVPR 2025] PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models☆52Jun 12, 2025Updated 9 months ago
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆51Jan 30, 2026Updated 2 months ago
- Placeholder☆10Jul 17, 2023Updated 2 years ago
- Official implementation of EMNLP 2021 Paper "Rethinking Zero-shot Neural Machine Translation: From a Perspective of Latent Variables"☆12May 15, 2023Updated 2 years ago
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- This package is EasyOCR-based optical character recognition. Unlike EasyOCR, the package uses a pre-saved with onnx language models, so i…☆13Mar 9, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆12Aug 31, 2021Updated 4 years ago
- Seamlessly shorten songs using AWS Lambda☆19Apr 28, 2019Updated 6 years ago
- ☆17Jun 8, 2019Updated 6 years ago
- ☆15Jul 5, 2022Updated 3 years ago
- ☆30Jun 9, 2025Updated 9 months ago
- Original tensorflow implementation of SILOT (Spatially Invariant, Label-free Object Tracking).☆13Mar 24, 2023Updated 3 years ago
- ☆14Jul 14, 2025Updated 8 months ago
- Official PyTorch implementation of paper "Schema Inference for Interpretable Image Classification" (ICLR 2023)☆15Apr 6, 2023Updated 2 years ago
- Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasonin…☆10Jun 16, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Differentiable Clustering with Perturbed Random Forests, NeurIPS2023☆13Oct 16, 2023Updated 2 years ago
- ☆13Jul 23, 2024Updated last year
- The open-source Mixture of Depths code and the official implementation of the paper "Router-Tuning: A Simple and Effective Approach for E…☆29Updated this week
- ☆12Dec 15, 2023Updated 2 years ago
- ☆30Jul 24, 2025Updated 8 months ago
- ☆13Sep 25, 2023Updated 2 years ago
- ☆37Nov 8, 2024Updated last year
- Code for "Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation" (Findings of ACL 2024)☆16Jul 4, 2024Updated last year
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆26Jan 14, 2025Updated last year
- [ECCV '24] On the Utility of 3D Hand Poses for Action Recognition☆18Jun 8, 2025Updated 9 months ago
- Official implementation of Stitched Image Quality evaluator (SIQE)☆22Sep 24, 2021Updated 4 years ago
- [CIKM 2022] Towards Automated Over-Sampling for Imbalanced Classification☆10Mar 20, 2023Updated 3 years ago
- Official repository of ECCV 2024 paper - "HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization"☆19Aug 23, 2024Updated last year
- TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of u…☆25Jun 4, 2025Updated 9 months ago
- Official implementation of paper ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding☆40Mar 16, 2025Updated last year