Implementation of the "Learn No to Say Yes Better" paper.
☆40Apr 3, 2026Updated 2 weeks ago
Alternatives and similar repositories for CoN-CLIP
Users that are interested in CoN-CLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code repository for "Post-pre-training for Modality Alignment in Vision-Language Foundation Models" (CVPR2025)☆39Jul 25, 2025Updated 8 months ago
- ☆25Jul 10, 2023Updated 2 years ago
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 2 years ago
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆53Jun 12, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Data release for Step Differences in Instructional Video (CVPR24)☆14Jun 19, 2024Updated last year
- Papers from the intersection of surgery and data science / machine learning☆15Jan 28, 2024Updated 2 years ago
- Learning by Aligning Videos in Time (CVPR 2021)☆14Sep 10, 2023Updated 2 years ago
- Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Align…☆19Apr 5, 2024Updated 2 years ago
- Official code of the paper "EgoExOR: EgoExOR: An Ego-Exo-Centric Operating Room Dataset for Surgical Activity Understanding" accepted at …☆26Feb 20, 2026Updated 2 months ago
- Latest Advances on Modality Priors in Multimodal Large Language Models☆31Dec 10, 2025Updated 4 months ago
- Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models☆47Sep 25, 2023Updated 2 years ago
- AlignCLIP: Improving Cross-Modal Alignment in CLIP (ICLR 2025)☆63Mar 1, 2025Updated last year
- ☆40Apr 8, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Aug 14, 2022Updated 3 years ago
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"☆47Dec 1, 2024Updated last year
- [CVPR 2025] Offical implementation of the paper "Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters The…☆32Mar 12, 2026Updated last month
- Sets of Image Provenance cases, including node and edge information, generated automatically using Reddit Photoshop Battles☆14Jul 26, 2018Updated 7 years ago
- [NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training☆224Mar 20, 2025Updated last year
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)☆35Sep 9, 2024Updated last year
- Repo for "Synergy of Sight and Semantics: Visual Intention Understanding with CLIP"☆12Mar 12, 2025Updated last year
- Distributed Optimization Infra for learning CLIP models☆29Oct 3, 2024Updated last year
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆28Jul 18, 2025Updated 9 months ago
- Smooth Variational Graph Embeddings for Efficient Neural Architecture Search☆14Feb 2, 2023Updated 3 years ago
- The official PyTorch implementation of the IEEE/CVF Computer Vision and Pattern Recognition (CVPR) '24 paper PREGO: online mistake detect…☆32Jun 9, 2025Updated 10 months ago
- Towards a Unified View on Visual Parameter-Efficient Transfer Learning☆26Oct 13, 2022Updated 3 years ago
- Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"☆38Aug 18, 2024Updated last year
- This is the official repository of our NeurIPS 2025 paper "MaxSup: Overcoming Representation Collapse in Label Smoothing"☆22Nov 6, 2025Updated 5 months ago
- This is the official implementation of our ACL 2025 Main paper "Balancing Diversity and Risk in LLM Sampling".☆17Oct 16, 2025Updated 6 months ago
- VisualGPTScore for visio-linguistic reasoning☆27Oct 7, 2023Updated 2 years ago
- NegCLIP.☆40Feb 6, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [WACV 2026] An extremely simple method for validation-free efficient adaptation of CLIP-like VLMs that is robust to the learning rate.☆33Apr 17, 2025Updated last year
- [ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"☆85Jul 4, 2024Updated last year
- Smooth Variational Graph Embeddings for Efficient Neural Architecture Search☆15Apr 8, 2024Updated 2 years ago
- ☆22Sep 16, 2025Updated 7 months ago
- Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…☆61Jul 8, 2023Updated 2 years ago
- This is the official implementation of our BMVC 2022 paper "SP-ViT: Learning 2D Spatial Priors for Vision Transformers"☆13Mar 27, 2023Updated 3 years ago
- Open-source strong baseline for domain generlization re-ID. We will udpate the strong baseline and CFD method~☆10Nov 30, 2021Updated 4 years ago