[ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"
☆58Sep 3, 2024Updated last year
Alternatives and similar repositories for WCA
Users that are interested in WCA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models☆19Sep 3, 2024Updated last year
- [ICML 2024 Spotlight] "Sample-specific Masks for Visual Reprogramming-based Prompting"☆12Dec 20, 2024Updated last year
- [ICML 2024] "Improving Accuracy-robustness Trade-off via Pixel Reweighted Adversarial Training"☆17Jun 4, 2024Updated last year
- Pytorch implementation for "Progressive Video Summarization via Multimodal Self-supervised Learning"☆36Aug 26, 2025Updated 6 months ago
- Dataset for "Video Crowd Localization with Multi-focus Gaussian Neighborhood Attention and a Large-Scale Benchmark"☆35Dec 9, 2025Updated 3 months ago
- [ICML2024] Official PyTorch implementation of CoMC: Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition☆16Jul 9, 2024Updated last year
- Generating Image Specific Text☆29Aug 14, 2023Updated 2 years ago
- Official code for the paper 'Spatial-temporal Forecasting for Regions without Observations'☆13Nov 9, 2025Updated 4 months ago
- Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports☆41Jan 3, 2026Updated 2 months ago
- [ICLR 2024 Spotlight] "Negative Label Guided OOD Detection with Pretrained Vision-Language Models"☆21Oct 23, 2024Updated last year
- [ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data