📍 Official repository of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS 2023)
☆56Nov 8, 2023Updated 2 years ago
Alternatives and similar repositories for protoclip
Users that are interested in protoclip are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🦩 Official repository of paper "Visual Instruction Tuning with Polite Flamingo" (AAAI-24 Oral)☆65Dec 9, 2023Updated 2 years ago
- [中国图象图形学报&ChinaMM2025] 非空间配准多模态目标检测决策融合策略☆40Jul 16, 2025Updated 11 months ago
- ☆11Oct 27, 2019Updated 6 years ago
- 🎁 A Large-scale Multi-modal E-Commerce Products Dataset (LTDL@IJCAI-21 Best Dataset & Pattern Recognition 2023)☆42Dec 30, 2023Updated 2 years ago
- 🚀 Codebase and Fondation Models for Visual Instruction Tuning☆14Aug 19, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Toolkit for Elevater Benchmark☆77Oct 17, 2023Updated 2 years ago
- ☆125Feb 21, 2023Updated 3 years ago
- ☆22Apr 27, 2024Updated 2 years ago
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆53Jun 12, 2023Updated 3 years ago
- A codebase for flexible and efficient Image Text Representation Alignment☆24Jun 20, 2023Updated 2 years ago
- [CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》☆150Jun 7, 2023Updated 3 years ago
- 🎶 Music-Driven Conducting Motion Generation (IEEE ICME'21 Best Demo)☆98Apr 7, 2023Updated 3 years ago
- A simple pytorch implementation of baseline based-on CLIP for Image-text Matching.☆19May 25, 2023Updated 3 years ago
- ☆110Dec 7, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Models☆21Jan 11, 2024Updated 2 years ago
- Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”☆31May 1, 2023Updated 3 years ago
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆20Oct 17, 2024Updated last year
- PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.0604…☆84Aug 16, 2022Updated 3 years ago
- vit for few-shot classification☆48Mar 24, 2023Updated 3 years ago
- [CVPR2024] Simple Semantic-Aided Few-Shot Learning☆62Sep 1, 2024Updated last year
- Few shot recognition using CLIP's OpenAI architecture.☆36Aug 2, 2021Updated 4 years ago
- [CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting☆549Sep 15, 2023Updated 2 years ago
- Official implementation of TagAlign☆37Dec 11, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆37Feb 1, 2022Updated 4 years ago
- Official Code of ECCV 2022 paper MS-CLIP☆91Jul 27, 2022Updated 3 years ago
- [ICCV 2023] Prompt-aligned Gradient for Prompt Tuning☆170Jul 15, 2023Updated 2 years ago
- Using LLMs and pre-trained caption models for super-human performance on image captioning.☆42Oct 13, 2023Updated 2 years ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- ☆28Oct 18, 2022Updated 3 years ago
- Implementation of "DIME-FM: DIstilling Multimodal and Efficient Foundation Models"☆15Oct 12, 2023Updated 2 years ago
- Compress conventional Vision-Language Pre-training data☆52Sep 22, 2023Updated 2 years ago
- ☆13Apr 13, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Unified Framework for Video-Language Understanding☆62Jun 17, 2023Updated 3 years ago
- code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022☆271Oct 2, 2024Updated last year
- Code repository for the paper - "Neural Priming for Sample-Efficient Adaptation"☆14Nov 13, 2023Updated 2 years ago
- Code for our ICLR 2024 paper "PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts"☆80May 5, 2024Updated 2 years ago
- ☆27Jan 17, 2026Updated 5 months ago
- The official repo for the DanQing dataset.☆36Mar 25, 2026Updated 2 months ago
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆19Jun 3, 2025Updated last year