tmlr-group/WCA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tmlr-group/WCA)

tmlr-group / WCA

[ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"

☆59

Alternatives and similar repositories for WCA

Users that are interested in WCA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tmlr-group / SMM
View on GitHub
[ICML 2024 Spotlight] "Sample-specific Masks for Visual Reprogramming-based Prompting"
☆12Dec 20, 2024Updated last year
JinhaoLee / WCA
View on GitHub
[ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models
☆19Mar 23, 2026Updated 4 months ago
HopLee6 / VJMHT-PyTorch
View on GitHub
Pytorch implementation for "Video Joint Modelling Based on Hierarchical Transformer for Co-summarization"
☆15Aug 24, 2025Updated 11 months ago
tmlr-group / PART
View on GitHub
[ICML 2024] "Improving Accuracy-robustness Trade-off via Pixel Reweighted Adversarial Training"
☆17Jun 4, 2024Updated 2 years ago
HopLee6 / RRIN-PyTorch
View on GitHub
PyTorch Implementation of "Video Frame Interpolation via Residue Refinement"
☆69May 16, 2020Updated 6 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
HopLee6 / SSPVS-PyTorch
View on GitHub
Pytorch implementation for "Progressive Video Summarization via Multimodal Self-supervised Learning"
☆36Aug 26, 2025Updated 10 months ago
yic20 / CoMC
View on GitHub
[ICML2024] Official PyTorch implementation of CoMC: Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition
☆17Jul 9, 2024Updated 2 years ago
emu1729 / GIST
View on GitHub
Generating Image Specific Text
☆29Aug 14, 2023Updated 2 years ago
tmlr-group / NegLabel
View on GitHub
[ICLR 2024 Spotlight] "Negative Label Guided OOD Detection with Pretrained Vision-Language Models"
☆21Oct 23, 2024Updated last year
tmlr-group / BayesianLM
View on GitHub
[NeurIPS 2024 Oral] "Bayesian-Guided Label Mapping for Visual Reprogramming"
☆12Dec 20, 2024Updated last year
ThomasWangY / 2024-AAAI-HPT
View on GitHub
Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)
☆75Feb 3, 2025Updated last year
StanfordMIMI / villa
View on GitHub
[ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data
☆45Oct 15, 2023Updated 2 years ago
changyanchuan / SARN
View on GitHub
SARN: Spatial Structure-Aware Road Network Embedding via Graph Contrastive Learning - EDBT 2023
☆20Jun 30, 2026Updated 3 weeks ago
suzy0223 / STSM
View on GitHub
Official code for the paper 'Spatial-temporal Forecasting for Regions without Observations'
☆15Nov 9, 2025Updated 8 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
tmlr-group / SCT
View on GitHub
[NeurIPS 2024] "Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection"
☆13Oct 28, 2024Updated last year
mainaksingha01 / ODG-CLIP
View on GitHub
☆21Oct 9, 2025Updated 9 months ago
michiganleon / ReCLIP_WACV
View on GitHub
☆18Mar 4, 2024Updated 2 years ago
QizhouWang / MAIL
View on GitHub
source code for NeurIPS21 paper robabilistic Margins for Instance Reweighting in Adversarial Training
☆11Apr 28, 2022Updated 4 years ago
vladan-stojnic / ZLaP
View on GitHub
Code for Label Propagation for Zero-shot Classification with Vision-Language Models (CVPR2024)
☆45Jul 23, 2024Updated 2 years ago
dmoltisanti / air-cvpr23
View on GitHub
This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…
☆13May 25, 2023Updated 3 years ago
CHENGY12 / PLOT
View on GitHub
[ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models
☆177Dec 14, 2023Updated 2 years ago
Saehyung-Lee / PlugIR
View on GitHub
Official repository of "Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach" (ACL 2024 Oral)
☆34Mar 24, 2025Updated last year
dhg-wei / TOPA
View on GitHub
(NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment
☆29Sep 27, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
XueJiang16 / NegLabel
View on GitHub
[ICLR 2024 Spotlight] "Negative Label Guided OOD Detection with Pretrained Vision-Language Models"
☆30Oct 23, 2024Updated last year
salesforce / RRL
View on GitHub
☆20May 1, 2025Updated last year
wwangwitsel / ConfDiff
View on GitHub
[NeurIPS'23] Binary Classification with Confidence Difference
☆10May 13, 2024Updated 2 years ago
Go2Heart / EchoSight
View on GitHub
[EMNLP 2024 Findings] The official PyTorch implementation of EchoSight: Advancing Visual-Language Models with Wiki Knowledge.
☆90Jan 19, 2026Updated 6 months ago
ExplainableML / WaffleCLIP
View on GitHub
Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…
☆61Jul 8, 2023Updated 3 years ago
tmlr-group / ZS-NTTA
View on GitHub
[ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"
☆13Feb 22, 2025Updated last year
edchengg / oven_eval
View on GitHub
ICCV 2023 (Oral) Open-domain Visual Entity Recognition Towards Recognizing Millions of Wikipedia Entities
☆44Jun 7, 2025Updated last year
minglllli / CLIPFit
View on GitHub
[EMNLP 2024] Implementation of vision-language model fine-tuning via simple parameter-efficient modification
☆19Nov 24, 2024Updated last year
McGill-NLP / diffusion-itm
View on GitHub
Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"
☆33Mar 15, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
edchengg / infoseek_eval
View on GitHub
EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions
☆26May 30, 2024Updated 2 years ago
YCaigogogo / CODER
View on GitHub
☆22Apr 27, 2024Updated 2 years ago
Gank0078 / FineSSL
View on GitHub
Pytorch implementation for "Erasing the Bias: Fine-Tuning Foundation Models for Semi-Supervised Learning" (ICML 2024)
☆27May 11, 2025Updated last year
RoyalSkye / ATCL
View on GitHub
[NeurIPS 2022] "Adversarial Training with Complementary Labels: On the Benefit of Gradually Informative Attacks"
☆13Nov 11, 2022Updated 3 years ago
ytaek-oh / vl_compo
View on GitHub
☆10Jul 5, 2024Updated 2 years ago
zhangce01 / DualAdapter
View on GitHub
Code for Negative Yields Positive: Unified Dual-Path Adapter for Vision-Language Models
☆25Oct 29, 2024Updated last year
OPTML-Group / DP4TL
View on GitHub
[NeurIPS2023] "Selectivity Drives Productivity: Efficient Dataset Pruning for Enhanced Transfer Learning" by Yihua Zhang*, Yimeng Zhang*,…
☆14Oct 12, 2023Updated 2 years ago