[ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models
☆19Mar 23, 2026Updated 3 weeks ago
Alternatives and similar repositories for WCA
Users that are interested in WCA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"☆58Sep 3, 2024Updated last year
- Pytorch implementation for "Video Joint Modelling Based on Hierarchical Transformer for Co-summarization"☆15Aug 24, 2025Updated 7 months ago
- [ICML 2024 Spotlight] "Sample-specific Masks for Visual Reprogramming-based Prompting"☆12Dec 20, 2024Updated last year
- [ICML 2024] "Improving Accuracy-robustness Trade-off via Pixel Reweighted Adversarial Training"☆17Jun 4, 2024Updated last year
- Dataset for "Video Crowd Localization with Multi-focus Gaussian Neighborhood Attention and a Large-Scale Benchmark"☆35Dec 9, 2025Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for ACM MM 2023 paper - Regress Before Construct: Regress Autoencoder for Point Cloud Self-supervised Learning☆14Jan 19, 2024Updated 2 years ago
- Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports☆41Jan 3, 2026Updated 3 months ago
- ☆43May 31, 2023Updated 2 years ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆20May 8, 2025Updated 11 months ago
- Official code Implementation of "Text and Image Are Mutually Beneficial: Enhancing Training-Free Few-Shot Classification with CLIP" (AAA…☆21Dec 17, 2024Updated last year
- Code for Doubly deformable aggregation of covariance matrices for few-shot segmentation☆16Oct 25, 2022Updated 3 years ago
- ☆20Dec 17, 2024Updated last year
- Repo for "Uncertain Multimodal Intention and Emotion Understanding in the Wild"☆17Oct 20, 2025Updated 5 months ago
- ☆18Apr 7, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- code for studying OpenAI's CLIP explainability☆39Jan 7, 2022Updated 4 years ago
- ☆201May 10, 2023Updated 2 years ago
- Backport of multiprocessing.shared_memory in Python 3.8☆12Jan 5, 2024Updated 2 years ago
- CVPR2026☆29Sep 18, 2025Updated 7 months ago
- Code for WisdoM: Improving Multimodal Sentiment Analysis by Fusing Contextual World Knowledge☆17Dec 31, 2024Updated last year
- [Symmetry 2022] Code Release of PointSCNet: Point Cloud Structure and Correlation Learning based on Space Filling Curve guided Sampling☆13Feb 24, 2022Updated 4 years ago
- Under Construction☆11Nov 25, 2021Updated 4 years ago
- Code for MInD: Multimodal Information Disentanglement☆19Dec 17, 2025Updated 4 months ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- PyTorch Implementation of the Sequential Multiagent Rollout algorithm☆11Jun 28, 2024Updated last year
- [ICCV 2023] Official repository of paper titled "Why Is Prompt Tuning for Vision-Language Models Robust to Noisy Labels?"☆27Sep 20, 2023Updated 2 years ago
- Official code for the paper 'Spatial-temporal Forecasting for Regions without Observations'☆13Nov 9, 2025Updated 5 months ago
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆21Sep 24, 2025Updated 6 months ago
- The specific details of the AAAI2025 paper "Enriching Multimodal Sentiment Analysis Through Textual Emotional Descriptions of Visual-Audi…☆31Feb 27, 2025Updated last year
- Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment, CVPR, 2024☆108Jun 26, 2025Updated 9 months ago
- ☆10Jun 13, 2023Updated 2 years ago
- PyTorch code for the paper "CrossTransformers: spatially-aware few-shot transfer"☆25Dec 20, 2020Updated 5 years ago
- A curated list of papers & resources linked to concept learning☆12Aug 9, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Stream tweets with React, Express, Socket.io and Twitter☆11Apr 6, 2018Updated 8 years ago
- [NeurIPS 2024] "Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection"☆13Oct 28, 2024Updated last year
- (TPAMI'2024) ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation☆22Aug 8, 2024Updated last year
- ☆12Apr 12, 2026Updated last week
- ☆20Aug 22, 2024Updated last year
- [ICML 2024] "Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection"☆13Feb 15, 2025Updated last year
- Multimodal Sentiment Analysis with Image-Text Interaction Network☆17Aug 31, 2023Updated 2 years ago