emu1729/GIST

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/emu1729/GIST)

emu1729 / GIST

Generating Image Specific Text

☆29

Alternatives and similar repositories for GIST

Users that are interested in GIST are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BatsResearch / fudd
View on GitHub
Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification
☆11Nov 15, 2023Updated 2 years ago
FactoDeepLearning / MultitaskVLFM
View on GitHub
☆25Aug 1, 2023Updated 2 years ago
shubhamprshr27 / NeglectedTailsVLM
View on GitHub
This repository houses the code for the paper - "The Neglected of VLMs"
☆30Dec 31, 2025Updated 6 months ago
dmoltisanti / air-cvpr23
View on GitHub
This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…
☆13May 25, 2023Updated 3 years ago
tmlr-group / WCA
View on GitHub
[ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"
☆59Sep 3, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
BeierZhu / GLA
View on GitHub
[NeurIPS 2023] Generalized Logit Adjustment
☆40Apr 21, 2024Updated 2 years ago
mayug / VDT-Adapter
View on GitHub
This repository contains the code and datasets for our ICCV-W paper 'Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts…
☆30Feb 21, 2024Updated 2 years ago
umd-huang-lab / perceptionCLIP
View on GitHub
Code for our ICLR 2024 paper "PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts"
☆80May 5, 2024Updated 2 years ago
ExplainableML / WaffleCLIP
View on GitHub
Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…
☆61Jul 8, 2023Updated 3 years ago
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
YueYANG1996 / LaBo
View on GitHub
CVPR 2023: Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification
☆108May 28, 2024Updated 2 years ago
Jingchensun / Prompt-Adapter
View on GitHub
Prompt Tuning based Adapter for Vision-Language Model Adaption
☆16Sep 1, 2023Updated 2 years ago
vishaal27 / SuS-X
View on GitHub
Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]
☆104Aug 22, 2023Updated 2 years ago
RAIVNLab / neural-priming
View on GitHub
Code repository for the paper - "Neural Priming for Sample-Efficient Adaptation"
☆14Nov 13, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ananthu-aniraj / masking_strategies_bias_removal
View on GitHub
Masking Strategies for Background Bias Removal in Computer Vision Models (ICCVW OODCV 2023 paper)
☆16Jul 3, 2025Updated last year
mightyzau / RegionBLIP
View on GitHub
☆59Aug 7, 2023Updated 2 years ago
apple / ml-ogen
View on GitHub
☆13Apr 7, 2024Updated 2 years ago
PRIS-CV / Top-Down-Spatial-Attention-Loss
View on GitHub
Fine-Grained Visual Classification via Simultaneously Learning of Multi-regional Multi-grained Features
☆12Mar 2, 2021Updated 5 years ago
gengyuanmax / MeVTR
View on GitHub
Official github repo for ICCV2023 paper 'Multi-event Video-Text Retrieval'
☆20Feb 16, 2024Updated 2 years ago
Dichao-Liu / CMAL
View on GitHub
☆48Apr 9, 2023Updated 3 years ago
wangyu-ustc / LM4CV
View on GitHub
The official implementation of the paper **Learning Concise and Descriptive Attributes for Visual Recognition**
☆49Dec 14, 2023Updated 2 years ago
microsoft / klite
View on GitHub
[NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222
☆54Jun 12, 2023Updated 3 years ago
yangyangyang127 / APE
View on GitHub
[ICCV 2023] Code for "Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement"
☆150Apr 21, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Mia-YatingYu / STDD
View on GitHub
[AAAI'25]: Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP
☆23Aug 5, 2025Updated 11 months ago
Gank0078 / FineSSL
View on GitHub
Pytorch implementation for "Erasing the Bias: Fine-Tuning Foundation Models for Semi-Supervised Learning" (ICML 2024)
☆27May 11, 2025Updated last year
ramdrop / edgevl
View on GitHub
Offcial code for the ECCV2024 paper "Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities"
☆26Oct 1, 2024Updated last year
UCSB-AI / Discffusion
View on GitHub
Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"
☆29Apr 27, 2024Updated 2 years ago
yhygao / Explicd
View on GitHub
☆18Sep 19, 2024Updated last year
ml-jku / semantic-image-text-alignment
View on GitHub
☆25Jul 10, 2023Updated 3 years ago
yangbang18 / MultiCapCLIP
View on GitHub
(ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning
☆36Aug 8, 2024Updated last year
ethanlshen / HierNet
View on GitHub
Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…
☆23Nov 8, 2023Updated 2 years ago
sarahpratt / CuPL
View on GitHub
☆203May 10, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
SeitaroShinagawa / CLIP-visualization
View on GitHub
Attention visualization in CLIP
☆17Dec 7, 2022Updated 3 years ago
LiShuo1001 / LF2CS
View on GitHub
The code of Unsupervised Few-Shot Image Classification by Learning Features into Clustering Space
☆15Jul 12, 2022Updated 4 years ago
07Agarg / HAF
View on GitHub
Code for the Paper Learning Hierarchy Aware Features for Reducing Mistake Severity, accepted in ECCV 2022
☆15Dec 16, 2022Updated 3 years ago
zhuole1025 / LLMs_as_Visual_Explainers
View on GitHub
Official Repository for "LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions"
☆15Apr 20, 2025Updated last year
PRIS-CV / Making-a-Bird-AI-Expert-Work-for-You-and-Me
View on GitHub
Code release for "Making a Bird AI Expert Work for You and Me (TPAMI 2023)".
☆16May 4, 2023Updated 3 years ago
bladewaltz1 / PromptSwitch
View on GitHub
☆30Aug 14, 2023Updated 2 years ago
Becomebright / MTV
View on GitHub
Revisiting Multi-Task Visual Representation Learning
☆22Jan 21, 2026Updated 6 months ago