Victorwz/LLaVA-Llama-3

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Victorwz/LLaVA-Llama-3)

Victorwz / LLaVA-Llama-3

Reproduction of LLaVA-v1.5 based on Llama-3-8b LLM backbone.

☆64

Alternatives and similar repositories for LLaVA-Llama-3

Users that are interested in LLaVA-Llama-3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aimagelab / LLaVA-MORE
View on GitHub
[ICCVW 25] LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning
☆160Aug 8, 2025Updated 11 months ago
MajorDavidZhang / MCL
View on GitHub
code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning
☆20Jul 16, 2024Updated 2 years ago
mbzuai-oryx / LLaVA-pp
View on GitHub
🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)
☆842Aug 5, 2025Updated 11 months ago
THUNLP-MT / ActiView
View on GitHub
☆11Dec 20, 2024Updated last year
kaist-ami / BEAF
View on GitHub
[ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"
☆22Mar 26, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
FreedomIntelligence / FastLLM
View on GitHub
Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];
☆41Jan 4, 2024Updated 2 years ago
luogen1996 / LLaVA-HR
View on GitHub
[ICLR2025] LLaVA-HR: High-Resolution Large Language-Vision Assistant
☆249Aug 14, 2024Updated last year
TempleX98 / MoVA
View on GitHub
[NeurIPS 2024] MoVA: Adapting Mixture of Vision Experts to Multimodal Context
☆174Sep 25, 2024Updated last year
WildVision-AI / WildVision-Bench
View on GitHub
☆17Oct 21, 2024Updated last year
Tree-Shu-Zhao / RebQ.pytorch
View on GitHub
This is the official code for the paper "Reconstruct before Query: Continual Missing Modality Learning with Decomposed Prompt Collaborati…
☆12Aug 13, 2024Updated last year
shrimantasatpati / Microsoft-Phi-2-Streamlit
View on GitHub
Microsoft Phi 2 Streamlit App, deployed on HuggingFace Spaces is based on the Microsoft Phi 2 small language model (SLM) for text generat…
☆15May 1, 2024Updated 2 years ago
Victorwz / LLaVA-Unified
View on GitHub
☆23Aug 27, 2025Updated 11 months ago
Victorwz / LaViA
View on GitHub
☆10Jul 13, 2024Updated 2 years ago
weixuansun / GETAM
View on GitHub
☆10Nov 29, 2022Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
GraphPKU / CoI
View on GitHub
Chain of Images for Intuitively Reasoning
☆10Nov 29, 2023Updated 2 years ago
anonymous-sushi-armadillo / fast_is_better_than_free_imagenet
View on GitHub
☆10Sep 25, 2019Updated 6 years ago
Richar-Du / Virgo
View on GitHub
Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*
☆20May 27, 2025Updated last year
MajorDavidZhang / Generalization_unified_VLM
View on GitHub
☆24May 23, 2025Updated last year
leeguandong / XrayLLaVA
View on GitHub
基于LLaVA1.6微调的Xray识别的多模态大模型
☆10Oct 22, 2024Updated last year
lixinustc / GraphAdapter
View on GitHub
The efficient tuning method for VLMs
☆83Mar 10, 2024Updated 2 years ago
djm209 / HSTGODE
View on GitHub
HSTGODE code
☆11Nov 26, 2023Updated 2 years ago
sanketx / AL-foundation-models
View on GitHub
Active Learning in the era of Foundation Models
☆14Apr 16, 2025Updated last year
DTennant / Incremental-Generalized-Category-Discovery
View on GitHub
☆15Oct 27, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
AgentMaker / WebAI.js
View on GitHub
A simple Web AI model deployment tool using JavaScript based on OpenCV.js and ONNXRuntime
☆58Jul 8, 2024Updated 2 years ago
ant-research / DreamLIP
View on GitHub
[ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions
☆138May 8, 2025Updated last year
nlp-uoregon / ullme
View on GitHub
☆20Apr 8, 2025Updated last year
flyfj / VisionToolbox
View on GitHub
a set of tools for computer vision processing
☆18Jul 9, 2016Updated 10 years ago
OpenGVLab / LAMM
View on GitHub
[NeurIPS 2023 Datasets and Benchmarks Track] LAMM: Multi-Modal Large Language Models and Applications as AI Agents
☆317Apr 16, 2024Updated 2 years ago
Q-Future / Chinese-Q-Bench
View on GitHub
[WIP@Oct 13] 质衡-基准测试 (Q-Bench in Chinese)，包含中文版【底层视觉问答】和【底层视觉描述】数据集，以及中文提示下的图片质量评价。 We will release Q-Bench in more languages in the futu…
☆24Jan 7, 2024Updated 2 years ago
KaiyangZhou / on-device-dg
View on GitHub
On-Device Domain Generalization
☆47Nov 9, 2022Updated 3 years ago
Tencent / Freeze-Omni
View on GitHub
The official implement of Freeze-Omni.
☆16Jul 10, 2025Updated last year
roudimit / c2kd
View on GitHub
Code for the C2KD paper (ICASSP 2023)
☆20May 15, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Kyunnilee / visual_puzzles
View on GitHub
🧩 Official code repository for “Puzzled by Puzzles: When Vision-Language Models Can’t Take a Hint.”
☆15Sep 22, 2025Updated 10 months ago
gauss5930 / AlpaGasus2-QLoRA
View on GitHub
This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!
☆15Nov 22, 2023Updated 2 years ago
Abhiram4572 / mi_bart
View on GitHub
☆13Oct 23, 2024Updated last year
RunpeiDong / DreamLLM
View on GitHub
[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation
☆462Dec 2, 2024Updated last year
Z-Zheng / dynamic_highres_poverty
View on GitHub
Dynamic, high-resolution poverty measurement in data-scarce environments
☆11Dec 8, 2024Updated last year
LLaVA-VL / LLaVA-NeXT
View on GitHub
☆4,712Jun 15, 2026Updated last month
nanfangAlan / FSRFER
View on GitHub
a TensorFlow implementation of the paper "Feature Super-Resolution Based Facial Expression Recognition for Multi-scale Low-Resolution Ima…
☆13Nov 30, 2021Updated 4 years ago