Victorwz/LLaVA-Unified

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Victorwz/LLaVA-Unified)

Victorwz / LLaVA-Unified

☆23

Alternatives and similar repositories for LLaVA-Unified

Users that are interested in LLaVA-Unified are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rxtan2 / Koala-video-llm
View on GitHub
☆37Sep 16, 2024Updated last year
duyichao / NPDA-KNN-ST
View on GitHub
Official implementation of EMNLP'2022 paper "Non-Parametric Domain Adaptation for End-to-End Speech Translation"
☆11Oct 26, 2022Updated 3 years ago
Victorwz / tod_as_nlg
View on GitHub
Official implementation of SIGIR 2022 Paper "Task-Oriented Dialogue System as Natural Language Generation".
☆14Apr 6, 2022Updated 4 years ago
yikee / Knowledge_Conflict
View on GitHub
Resolving Knowledge Conflicts in Large Language Models, COLM 2024
☆18Oct 7, 2025Updated 9 months ago
Hon-Wong / ByteVideoLLM
View on GitHub
[ICCV 2025] Dynamic-VLM
☆28Dec 16, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
cybertronai / bflm
View on GitHub
☆17Jun 8, 2019Updated 7 years ago
surfertas / gt-cp-2017-project
View on GitHub
Visual Looming: Frontal obstacle avoidance using monocular camera for UAV
☆15Apr 23, 2017Updated 9 years ago
wacv-pcs / WACV-2023-Author-Kit
View on GitHub
☆15Jul 5, 2022Updated 4 years ago
ChenyuHeidiZhang / VL-commonsense
View on GitHub
☆14May 23, 2022Updated 4 years ago
e2crawfo / silot
View on GitHub
Original tensorflow implementation of SILOT (Spatially Invariant, Label-free Object Tracking).
☆13Mar 24, 2023Updated 3 years ago
jcolano / llama3_single_gpu
View on GitHub
☆13Jul 23, 2024Updated 2 years ago
szzexpoi / POEM
View on GitHub
Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasonin…
☆10Jun 16, 2024Updated 2 years ago
perladoubinsky / SemAug
View on GitHub
[WAVC 2024] Official implementation of the paper: Semantic Generative Augmentations for Few-shot Counting
☆13May 1, 2024Updated 2 years ago
tiankuan93 / C2FNet
View on GitHub
☆17Oct 20, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
duyichao / E2E-ST-TDA
View on GitHub
Official implementation of AAAI'2022 paper "Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement"
☆17Dec 23, 2021Updated 4 years ago
yl3800 / TranSTR
View on GitHub
☆12Dec 15, 2023Updated 2 years ago
sakibreza / ECCV24-HAT
View on GitHub
Official repository of ECCV 2024 paper - "HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization"
☆20Aug 23, 2024Updated last year
bigai-ai / QA-Synthesizer
View on GitHub
Adapt MLLMs to Domains via Post-Training (EMNLP 2025 Findings)
☆14Nov 11, 2025Updated 8 months ago
JianqiangWan / VLPT-STD
View on GitHub
Vision-Language Pre-Training for Boosting Scene Text Detectors (CVPR2022)
☆12Mar 21, 2022Updated 4 years ago
QUVA-Lab / PIN
View on GitHub
Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs
☆26Jan 14, 2025Updated last year
maxreciprocate / offline
View on GitHub
Offline RL experiments
☆15Oct 1, 2022Updated 3 years ago
microsoft / SparseMixer
View on GitHub
Sparse Backpropagation for Mixture-of-Expert Training
☆30Jul 2, 2024Updated 2 years ago
Leezekun / MMSci
View on GitHub
MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension
☆51Dec 3, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
CheungZeeCn / fairseq
View on GitHub
rebert model codes based on fariseq
☆15Feb 28, 2021Updated 5 years ago
daochenzha / autosmote
View on GitHub
[CIKM 2022] Towards Automated Over-Sampling for Imbalanced Classification
☆10Mar 20, 2023Updated 3 years ago
CASIA-IVA-Lab / OPT_Questioner
View on GitHub
Official PyTorch implementation of the paper "Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner"
☆15Aug 9, 2023Updated 2 years ago
EdinburghNLP / spot-data
View on GitHub
Sentiment polarity annotations dataset
☆26Nov 28, 2017Updated 8 years ago
FangXiuwen / FSMAFL
View on GitHub
paper code commit-fsmafl
☆10Mar 18, 2024Updated 2 years ago
jiangqn / KSTER
View on GitHub
code and data for paper "Learning Kernel-Smoothed Machine Translation with Retrieved Examples"
☆24Mar 16, 2022Updated 4 years ago
OmriKaduri / vlm-interp
View on GitHub
Code for paper: "What’s in the Image? A Deep-Dive into the Vision of Vision Language Models" (CVPR 2025)
☆18May 1, 2025Updated last year
Chaolei98 / FreeZAD
View on GitHub
Code for “Training-Free Zero-Shot Temporal Action Detection with Vision-Language Models”
☆18Apr 27, 2026Updated 3 months ago
Victorwz / LLaVA-Llama-3
View on GitHub
Reproduction of LLaVA-v1.5 based on Llama-3-8b LLM backbone.
☆64Oct 25, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
CeeZh / SILVR
View on GitHub
Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"
☆19Jan 18, 2026Updated 6 months ago
haoyGONG / LP-Diff
View on GitHub
Code of paper "LP-Diff: Towards Improved Restoration of Real-World Degraded License Plate"
☆20Jun 22, 2025Updated last year
HITsz-TMG / Cognitive-Visual-Language-Mapper
View on GitHub
The codes and datasets about our ACL 2024 Main Conference paper titled "Cognitive Visual-Language Mapper: Advancing Multimodal Comprehens…
☆17Jan 24, 2025Updated last year
sammcj / vlm-ui
View on GitHub
Web Interface for Vision Language Models Including InternVLM2
☆27Jul 29, 2024Updated 2 years ago
marsggbo / NAS-LID
View on GitHub
[AAAI2023] NAS-LID: Efficient Neural Architecture Search with Local Intrinsic Dimension
☆17Dec 20, 2022Updated 3 years ago
Ethan-TZ / EulerNet
View on GitHub
[SIGIR 2023] This is the official PyTorch implementation for the paper: "EulerNet: Adaptive Feature Interaction Learning via Euler’s Form…
☆18Jul 31, 2024Updated last year
ictnlp / PCFG-NAT
View on GitHub
Code for NeurIPS 2023 paper "Non-autoregressive Machine Translation with Probabilistic Context-free Grammar".
☆12Jan 4, 2024Updated 2 years ago