zhuole1025/LLMs_as_Visual_Explainers

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhuole1025/LLMs_as_Visual_Explainers)

zhuole1025 / LLMs_as_Visual_Explainers

Official Repository for "LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions"

☆15

Alternatives and similar repositories for LLMs_as_Visual_Explainers

Users that are interested in LLMs_as_Visual_Explainers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CompVis / DisCLIP
View on GitHub
[AAAI 2025] Does VLM Classification Benefit from LLM Description Semantics?
☆26Aug 5, 2025Updated 11 months ago
HHenryD / TAP
View on GitHub
[ICLR'25] Official repository of paper titled "Tree of Attributes Prompt Learning for Vision-Language Models".
☆20Oct 15, 2025Updated 9 months ago
zhuole1025 / Structured-Visuals
View on GitHub
[ICLR2026] Factuality Matters: When Image Generation and Editing Meet Structured Visuals
☆37Nov 13, 2025Updated 8 months ago
r-three / realistic_evaluation_of_model_merging_for_compositional_generalization
View on GitHub
☆12Feb 11, 2026Updated 5 months ago
francisol / GZUthesis-template
View on GitHub
贵州大学研究生学位论文模板
☆12Apr 29, 2026Updated 2 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
zhanghr2001 / PromptTA
View on GitHub
Source-free Domain Generalization
☆16Sep 24, 2024Updated last year
Dom3442 / leafonlysam
View on GitHub
☆11Dec 6, 2024Updated last year
shiming-chen / LaZSL
View on GitHub
Official implementations of our LaZSL (ICCV'25)
☆45Jul 13, 2025Updated last year
JREion / DPC
View on GitHub
[CVPR 2025] Official PyTorch Code for "DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models"
☆48Apr 28, 2026Updated 2 months ago
ExplainableML / WaffleCLIP
View on GitHub
Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…
☆61Jul 8, 2023Updated 3 years ago
zhaohengz / LLaMP
View on GitHub
Official Repository for CVPR 2024 Paper: "Large Language Models are Good Prompt Learners for Low-Shot Image Classification"
☆45Jul 1, 2024Updated 2 years ago
William-wAng618 / M2PT
View on GitHub
Official repo of M$^2$PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning
☆29Mar 23, 2025Updated last year
bkocis / home-surveillance-with-multimodal-llms
View on GitHub
An example of using multimodal LLMs to processpide feed from camera and get image description
☆15Mar 11, 2024Updated 2 years ago
glchau / TOTEM_for_EEG_code
View on GitHub
Code for using TOTEM on EEG data
☆15Sep 24, 2025Updated 9 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
zerocstaker / constrained_ape
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆12Oct 10, 2020Updated 5 years ago
jingwu6 / LM_AG
View on GitHub
☆18Apr 8, 2024Updated 2 years ago
kochbj / Reduced_Reused_Recycled
View on GitHub
Github for "Reduced, Reused and Recycled" (NeurIPS 2021 Best Paper, D&B Track)
☆17Jan 8, 2022Updated 4 years ago
caopulan / CVPR24_Listener
View on GitHub
☆12Feb 2, 2024Updated 2 years ago
wns823 / NMT_SSP
View on GitHub
NMT with ssp
☆11Oct 28, 2021Updated 4 years ago
RMSnow / HAT
View on GitHub
Official repository for "Structure-Enhanced Pop Music Generation via Harmony-Aware Learning", ACM MM 2022.
☆14Mar 22, 2023Updated 3 years ago
LyWang12 / CUTI-Domain
View on GitHub
☆15Feb 11, 2025Updated last year
emu1729 / GIST
View on GitHub
Generating Image Specific Text
☆29Aug 14, 2023Updated 2 years ago
Hellcatzm / SSD_Realization_MXNet
View on GitHub
MXNet复现SSD目标检测网络
☆12Apr 2, 2019Updated 7 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
sarrouti / VQGR
View on GitHub
Visual Question Generation
☆11Aug 20, 2024Updated last year
qingma2016 / 3DT-Net
View on GitHub
☆19Jan 20, 2023Updated 3 years ago
coderlemon17 / LemonScripts
View on GitHub
Here is the repo for public scripts.
☆12Jul 16, 2022Updated 4 years ago
Egg-Hu / LoRA-Recycle
View on GitHub
[CVPR 2025] LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs
☆14Jun 20, 2025Updated last year
Andrew0613 / PICABench
View on GitHub
PICABench: How Far Are We from Physically Realistic Image Editing?
☆39Nov 5, 2025Updated 8 months ago
yeeeqichen / FiTs
View on GitHub
[AAAI 2023] Official implementation of FiTs: Fine-grained Two-stage Training for Knowledge Base Question Answering
☆11Mar 10, 2023Updated 3 years ago
ankur219 / Logo-Detection-SSD
View on GitHub
Logo detection in images using SSD
☆10Jul 13, 2018Updated 8 years ago
appletea233 / LLaVA-ST
View on GitHub
[CVPR 2025] LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding
☆84Jul 4, 2025Updated last year
Sample-design-alt / DANet
View on GitHub
☆26May 24, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ictnlp / NA-MNMT
View on GitHub
Source code for "Importance-based Neuron Allocation for Multilingual Neural Machine Translation"
☆12Sep 15, 2021Updated 4 years ago
QxGeng / DEA-Net
View on GitHub
☆15Dec 18, 2021Updated 4 years ago
ImKeTT / ZeroGen
View on GitHub
[NLPCC'23] ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple Oracles PyTorch Implementation
☆14Oct 7, 2023Updated 2 years ago
xiaomin418 / CFSum
View on GitHub
☆13Jan 9, 2024Updated 2 years ago
Picsart-AI-Research / Mask-Matching-Transformer
View on GitHub
☆15Jan 12, 2023Updated 3 years ago
H-TayyarMadabushi / AStitchInLanguageModels
View on GitHub
Data and Baselines for AStitchInLanguageModels dataset
☆13Oct 31, 2022Updated 3 years ago
zzzx1224 / EBTSA-ICLR2023
View on GitHub
☆12Feb 17, 2025Updated last year