The most impactful papers related to contrastive pretraining for multimodal models!
☆77Mar 5, 2024Updated 2 years ago
Alternatives and similar repositories for awesome-clip-papers
Users that are interested in awesome-clip-papers are comparing it to the libraries listed below
Sorting:
- Semantically Search Emojis From the Command Line!☆13Nov 26, 2023Updated 2 years ago
- Albumentations Data Augmentation Plugin for FiftyOne!☆14Aug 22, 2024Updated last year
- [NeurIPS 2024] WATT: Weight Average Test-Time Adaptation of CLIP☆56Sep 26, 2024Updated last year
- Testbed for multimodal retrieval augmented generation techniques with FiftyOne, LlamaIndex, and Milvus☆21Aug 9, 2024Updated last year
- ☆40Apr 8, 2024Updated last year
- Perform visual question answering on your images☆19May 8, 2024Updated last year
- [NeurIPS24] VisMin: Visual Minimal-Change Understanding☆19Mar 3, 2025Updated last year
- My journey during 10 weeks of building FiftyOne plugins☆22Nov 12, 2023Updated 2 years ago
- ☆54Jun 29, 2025Updated 8 months ago
- [ACM MM '24 Poster] Official repository of paper titled "Towards Robustness Prompt Tuning with Fully Test-Time Adaptation for CLIP’s Zero…☆10Aug 6, 2024Updated last year
- WIKIGENBENCH: Exploring Full-length Wikipedia Generation under Real-World Scenario (COLING 2025)☆12Jan 5, 2025Updated last year
- [CVPR 2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detec…☆67Apr 4, 2025Updated 11 months ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆92Jul 4, 2024Updated last year
- Code for ICCV2025 paper——IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves☆17Jul 11, 2025Updated 8 months ago
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆62Dec 10, 2024Updated last year
- An Open Dataset for Wireless Cellular Spectrum Monitoring and Anomaly Detection☆16Updated this week