[ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models
☆19Mar 23, 2026Updated last month
Alternatives and similar repositories for WCA
Users that are interested in WCA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"☆59Sep 3, 2024Updated last year
- Pytorch implementation for "Video Joint Modelling Based on Hierarchical Transformer for Co-summarization"☆15Aug 24, 2025Updated 8 months ago
- [ICML 2024 Spotlight] "Sample-specific Masks for Visual Reprogramming-based Prompting"☆12Dec 20, 2024Updated last year
- [ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"☆13Feb 22, 2025Updated last year
- [ICML 2024] "Improving Accuracy-robustness Trade-off via Pixel Reweighted Adversarial Training"☆17Jun 4, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Dataset for "Video Crowd Localization with Multi-focus Gaussian Neighborhood Attention and a Large-Scale Benchmark"☆36Dec 9, 2025Updated 5 months ago
- Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports☆42Jan 3, 2026Updated 4 months ago
- ☆43May 31, 2023Updated 2 years ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆21May 8, 2025Updated last year
- Official code Implementation of "Text and Image Are Mutually Beneficial: Enhancing Training-Free Few-Shot Classification with CLIP" (AAA…☆21Dec 17, 2024Updated last year
- Repo for "Uncertain Multimodal Intention and Emotion Understanding in the Wild"☆18Oct 20, 2025Updated 6 months ago
- code for studying OpenAI's CLIP explainability☆39Jan 7, 2022Updated 4 years ago
- CVPR2026☆30Sep 18, 2025Updated 7 months ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICCV 2023] Official repository of paper titled "Why Is Prompt Tuning for Vision-Language Models Robust to Noisy Labels?"☆27Sep 20, 2023Updated 2 years ago
- Repository for the CVPR-2023 paper : StyleAdv: Meta Style Adversarial Training for Cross-Domain Few-Shot Learning☆66Jun 2, 2025Updated 11 months ago
- The specific details of the AAAI2025 paper "Enriching Multimodal Sentiment Analysis Through Textual Emotional Descriptions of Visual-Audi…☆33Feb 27, 2025Updated last year
- PyTorch implementation of Expectation over Transformation☆13Jul 18, 2025Updated 9 months ago
- ☆10Jun 13, 2023Updated 2 years ago
- Pytorch source code of ESPT method in AAAI 2023☆22Jul 23, 2023Updated 2 years ago
- PyTorch code for the paper "CrossTransformers: spatially-aware few-shot transfer"☆25Dec 20, 2020Updated 5 years ago
- A curated list of papers & resources linked to concept learning☆13Aug 9, 2023Updated 2 years ago
- [NeurIPS 2024] "Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection"☆13Oct 28, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- (TPAMI'2024) ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation☆22Aug 8, 2024Updated last year
- Code for Label Propagation for Zero-shot Classification with Vision-Language Models (CVPR2024)☆45Jul 23, 2024Updated last year
- This repository is a collection of awesome things about vision prompts, including papers, code, etc.☆40Dec 22, 2023Updated 2 years ago
- ☆32May 22, 2025Updated 11 months ago
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆22Sep 24, 2025Updated 7 months ago
- PyTorch Implementation for InMaP☆12Oct 28, 2023Updated 2 years ago
- [ACL 2025] RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection☆34Jul 23, 2025Updated 9 months ago
- Colorful Prompt Tuning for Pre-trained Vision-Language Models☆49Nov 1, 2022Updated 3 years ago
- Implementation of Weakly Supervised Deep Detection Networks with PyTorch☆12Dec 7, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- (TPAMI 2026) Complementary Text-Guided Attention for Zero-Shot Adversarial Robustness & & (NeurIPS 2024) Text-Guided Attention is All Y…☆20Mar 23, 2026Updated last month
- Codebase for TF-Mamba: Text-enhanced Fusion Mamba with Missing Modalities for Robust Multimodal Sentiment Analysis. The code has been reo…☆33May 27, 2025Updated 11 months ago
- Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".☆17Dec 15, 2021Updated 4 years ago
- Implementation of the paper Unsupervised Learning of Video Representations using LSTMs☆10Nov 24, 2017Updated 8 years ago
- (CVPR2026 Oral) ANTS: Adaptive Negative Textual Space Shaping for OOD Detection via Test-Time MLLM Understanding and Reasoning☆44Apr 18, 2026Updated 3 weeks ago
- SARN: Spatial Structure-Aware Road Network Embedding via Graph Contrastive Learning - EDBT 2023☆20Aug 7, 2023Updated 2 years ago
- Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification☆11Nov 15, 2023Updated 2 years ago