megvii-research / protoclipView external linksLinks
π Official repository of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS 2023)
β55Nov 8, 2023Updated 2 years ago
Alternatives and similar repositories for protoclip
Users that are interested in protoclip are comparing it to the libraries listed below
Sorting:
- 𦩠Official repository of paper "Visual Instruction Tuning with Polite Flamingo" (AAAI-24 Oral)β65Dec 9, 2023Updated 2 years ago
- β124Feb 21, 2023Updated 2 years ago
- β20Apr 23, 2024Updated last year
- Toolkit for Elevater Benchmarkβ76Oct 17, 2023Updated 2 years ago
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222β53Jun 12, 2023Updated 2 years ago
- β22Apr 27, 2024Updated last year
- PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.0604β¦β84Aug 16, 2022Updated 3 years ago
- SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Modelsβ21Jan 11, 2024Updated 2 years ago
- Using LLMs and pre-trained caption models for super-human performance on image captioning.β42Oct 13, 2023Updated 2 years ago
- [ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"β13Jun 11, 2023Updated 2 years ago
- β10Jul 5, 2024Updated last year
- β11Oct 27, 2019Updated 6 years ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Dataβ14Sep 30, 2023Updated 2 years ago
- Project for SNARE benchmarkβ11Jun 5, 2024Updated last year
- β29Oct 18, 2022Updated 3 years ago
- vit for few-shot classificationβ47Mar 24, 2023Updated 2 years ago
- [ICCV 2023] Prompt-aligned Gradient for Prompt Tuningβ167Jul 15, 2023Updated 2 years ago
- [CVPR 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understandingβ55Apr 7, 2025Updated 10 months ago
- [CVPR2024] Simple Semantic-Aided Few-Shot Learningβ54Sep 1, 2024Updated last year
- The official repo for the DanQing dataset.β29Jan 16, 2026Updated last month
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!β11May 24, 2023Updated 2 years ago
- Implementation of "DIME-FM: DIstilling Multimodal and Efficient Foundation Models"β15Oct 12, 2023Updated 2 years ago
- This is the repository for "SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Recognition"β16Oct 8, 2024Updated last year
- β50Oct 29, 2023Updated 2 years ago
- Masking Strategies for Background Bias Removal in Computer Vision Models (ICCVW OODCV 2023 paper)β16Jul 3, 2025Updated 7 months ago
- VaLM: Visually-augmented Language Modeling. ICLR 2023.β56Mar 6, 2023Updated 2 years ago
- code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022β268Oct 2, 2024Updated last year
- Official Code of ECCV 2022 paper MS-CLIPβ91Jul 27, 2022Updated 3 years ago
- Code for AutoGeo.β16Aug 18, 2024Updated last year
- Code repository for the paper - "Neural Priming for Sample-Efficient Adaptation"β14Nov 13, 2023Updated 2 years ago
- State of What Art? A Call for Multi-Prompt LLM Evaluationβ15Jul 10, 2024Updated last year
- A simple pytorch implementation of baseline based-on CLIP for Image-text Matching.β18May 25, 2023Updated 2 years ago
- Official implementation of TagAlignβ35Dec 11, 2024Updated last year
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?β35Apr 27, 2023Updated 2 years ago
- Code for EMNLP 2022 paper βDistilled Dual-Encoder Model for Vision-Language Understandingββ31May 1, 2023Updated 2 years ago
- β105Dec 7, 2023Updated 2 years ago
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Trainingβ141Dec 16, 2025Updated 2 months ago
- A Unified Framework for Video-Language Understandingβ61Jun 17, 2023Updated 2 years ago
- Codebase for adaptive continual memoryβ13Aug 15, 2023Updated 2 years ago