adobe-research/llava-score

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/adobe-research/llava-score)

adobe-research / llava-score

☆11

Alternatives and similar repositories for llava-score

Users that are interested in llava-score are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

arijitray1993 / COLA
View on GitHub
COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!
☆25May 14, 2026Updated 2 months ago
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
FeiElysia / awesome-zero-shot-captioning
View on GitHub
A curated list of zero-shot captioning papers
☆24Aug 26, 2023Updated 2 years ago
iOPENCap / awesome-unimodal-training
View on GitHub
text-only training or language-free training for multimodal tasks (image/audio/video caption, retrieval, text2image)
☆12Oct 15, 2024Updated last year
sjz5202 / LLaVA-Reward
View on GitHub
Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation
☆26Jul 30, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
chancharikmitra / SAVs
View on GitHub
Official Codebase for "Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers"
☆26Jun 7, 2025Updated last year
uvavision / SyViC
View on GitHub
[ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data
☆13Sep 30, 2023Updated 2 years ago
cfeng16 / GPS2Pix
View on GitHub
[CVPR 2025] GPS as a Control Signal for Image Generation
☆25Mar 18, 2025Updated last year
fuxianghuang1 / Multimodal-Composite-Editing-and-Retrieval
View on GitHub
Multimodal-Composite-Editing-and-Retrieval-update
☆35Oct 13, 2025Updated 9 months ago
GXYM / VCapsBench
View on GitHub
VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation
☆20Jun 2, 2025Updated last year
mengchuang123 / VASparse-github
View on GitHub
[CVPR 2025] VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification
☆50Mar 24, 2025Updated last year
google-deepmind / wyd-benchmark
View on GitHub
☆28Mar 3, 2025Updated last year
linzhiqiu / visual_gpt_score
View on GitHub
VisualGPTScore for visio-linguistic reasoning
☆27Oct 7, 2023Updated 2 years ago
VidCapBench / VidCapBench
View on GitHub
☆13May 17, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ChenyuHeidiZhang / VL-commonsense
View on GitHub
☆14May 23, 2022Updated 4 years ago
TerminologyHub / termhub-in-5-minutes
View on GitHub
Developer project for getting basic API integrations working in under 5 minutes
☆11May 22, 2026Updated 2 months ago
dhg-wei / TOPA
View on GitHub
(NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment
☆29Sep 27, 2024Updated last year
alycialee / beyond-scale-language-data-diversity
View on GitHub
☆13Updated this week
cvl-umass / ssl-evaluation
View on GitHub
Evaluation of semi-supervised learning on challenging datasets
☆38Dec 21, 2021Updated 4 years ago
nishadsinghi / sc-genrm-scaling
View on GitHub
[COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…
☆15Oct 31, 2025Updated 8 months ago
shubhamprshr27 / NeglectedTailsVLM
View on GitHub
This repository houses the code for the paper - "The Neglected of VLMs"
☆30Dec 31, 2025Updated 6 months ago
saibr / hypvl
View on GitHub
This repository is related to 'Intriguing Properties of Hyperbolic Embeddings in Vision-Language Models', published at TMLR (2024), https…
☆21Jul 5, 2024Updated 2 years ago
RAIVNLab / CREPE
View on GitHub
[CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?
☆35Apr 27, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
LgQu / TIGeR
View on GitHub
Code for paper: Unified Text-to-Image Generation and Retrieval
☆16Updated this week
YeeZ93 / Awesome-Object-Centric-Learning
View on GitHub
A curated list of researches in object-centric learning
☆11Oct 14, 2024Updated last year
Heidelberg-NLP / VALSE
View on GitHub
Data repository for the VALSE benchmark.
☆40Feb 15, 2024Updated 2 years ago
IVY-LVLM / CODE
View on GitHub
Official Implementation of CODE
☆17Sep 26, 2024Updated last year
aszala / VPEval
View on GitHub
VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)
☆45Nov 29, 2023Updated 2 years ago
sjz5202 / DisenLink
View on GitHub
Implementation for paper "Link Prediction on Heterophilic Graphs via Disentangled Representation Learning"
☆13Aug 26, 2022Updated 3 years ago
baaaad / ECE
View on GitHub
[ECCV'22 Poster] Explicit Image Caption Editing
☆22Nov 30, 2022Updated 3 years ago
jameslahm / SCPNet
View on GitHub
Exploring Structured Semantic Prior for Multi Label Recognition with Incomplete Labels [CVPR 2023]
☆14Sep 23, 2023Updated 2 years ago
pkien01 / draw-on-face
View on GitHub
Coloring lips and drawing glasses on faces in custom images or live webcam
☆11Sep 10, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
BeierZhu / GLA
View on GitHub
[NeurIPS 2023] Generalized Logit Adjustment
☆40Apr 21, 2024Updated 2 years ago
orrzohar / Video-STaR
View on GitHub
[ICLR 2025] Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision
☆72Jul 10, 2024Updated 2 years ago
mtli / DLGPUBench
View on GitHub
Code for Deep Learning GPU Benchmark: A Latency-Based Approach
☆14Mar 21, 2025Updated last year
Kai-46 / colmap_in_docker
View on GitHub
install colmap in a docker
☆12Apr 24, 2020Updated 6 years ago
kaist-ami / BEAF
View on GitHub
[ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"
☆22Mar 26, 2025Updated last year
HeimingX / TAG
View on GitHub
Official code for Attention-driven GUI Grounding, AAAI2025
☆15Dec 17, 2024Updated last year
Victorwz / VaLM
View on GitHub
VaLM: Visually-augmented Language Modeling. ICLR 2023.
☆56Mar 6, 2023Updated 3 years ago