[ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models
☆19Sep 3, 2024Updated last year
Alternatives and similar repositories for WCA
Users that are interested in WCA are comparing it to the libraries listed below
Sorting:
- [ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"☆58Sep 3, 2024Updated last year
- [ICML 2024 Spotlight] "Sample-specific Masks for Visual Reprogramming-based Prompting"☆12Dec 20, 2024Updated last year
- [ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"☆12Feb 22, 2025Updated last year
- [ICML 2024] "Improving Accuracy-robustness Trade-off via Pixel Reweighted Adversarial Training"☆17Jun 4, 2024Updated last year
- This repository is a collection of awesome things about vision prompts, including papers, code, etc.☆40Dec 22, 2023Updated 2 years ago
- Dataset for "Video Crowd Localization with Multi-focus Gaussian Neighborhood Attention and a Large-Scale Benchmark"☆35Dec 9, 2025Updated 3 months ago
- code for studying OpenAI's CLIP explainability☆38Jan 7, 2022Updated 4 years ago
- Code for Label Propagation for Zero-shot Classification with Vision-Language Models (CVPR2024)☆45Jul 23, 2024Updated last year
- ☆12Jun 26, 2024Updated last year
- ☆10Mar 18, 2025Updated 11 months ago
- Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports☆41Jan 3, 2026Updated 2 months ago
- Implementation of the paper Unsupervised Learning of Video Representations using LSTMs☆10Nov 24, 2017Updated 8 years ago
- A machine-learning system to predict the location of Brachial Plexus nerve in ultrasound images of the human neck to aid in surgical prep…☆12Jun 5, 2017Updated 8 years ago
- A wrapper of the 'React-Toastify' library, for usage in Shiny.☆12Jul 31, 2021Updated 4 years ago
- Open set classification of car models. This 3-step classifier solves the problem where dogs are classified as cars, by first filtering th…☆12Feb 4, 2023Updated 3 years ago
- Repo for "Uncertain Multimodal Intention and Emotion Understanding in the Wild"☆16Oct 20, 2025Updated 4 months ago
- ☆18Dec 17, 2024Updated last year
- ☆12Mar 5, 2025Updated last year
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆20Sep 24, 2025Updated 5 months ago
- Official implementation of "MambaPainter: Neural Stroke-Based Rendering in a Single Step."☆14Sep 29, 2024Updated last year
- ☆10Nov 8, 2023Updated 2 years ago
- LoDoPaB-CT Grand Challenge Code☆14Dec 18, 2020Updated 5 years ago
- ☆199May 10, 2023Updated 2 years ago
- Official implementation of ICML 2025 paper "Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach"☆11May 27, 2025Updated 9 months ago
- Official code for "Vision Transformers with Self-Distilled Registers" (NeurIPS 2025 Spotlight)☆32Dec 6, 2025Updated 3 months ago
- [ICML 2024] "Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection"☆15Feb 15, 2025Updated last year
- ☆18Apr 7, 2025Updated 11 months ago
- ☆10Jun 13, 2023Updated 2 years ago
- [ICML 2024] "Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection"☆14Feb 15, 2025Updated last year
- Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification☆11Nov 15, 2023Updated 2 years ago
- CVPR2026☆25Sep 18, 2025Updated 5 months ago
- Exploring prompt tuning with pseudolabels for multiple modalities, learning settings, and training strategies.☆49Nov 8, 2024Updated last year
- here are some classic networks for image classification implement by pytorch☆12Sep 4, 2020Updated 5 years ago
- Teaching Material for COMP90086 - Computer Vision☆15Oct 20, 2023Updated 2 years ago
- Code for BYOP [CVPR 2023]☆12Sep 25, 2023Updated 2 years ago
- [NeurIPS 2023] "Learning to Augment Distributions for Out-of-distribution Detection"☆11Nov 14, 2023Updated 2 years ago
- Comparing CNN+Softmax with CNN+SVM on CIFAR 10 Dataset☆14Jan 26, 2019Updated 7 years ago
- Guided patch-wise nonlocal SAR despeckling☆13Sep 2, 2021Updated 4 years ago
- [ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"☆17Feb 22, 2025Updated last year