YYJMJC / LOUPELinks

☆45

Alternatives and similar repositories for LOUPE

Users that are interested in LOUPE are comparing it to the libraries listed below

Sorting:

thunlp / PEVL
Source code for EMNLP 2022 paper “PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models”
☆48Updated 3 years ago
sail-sg / ptp
[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》
☆151Updated 2 years ago
dhg-wei / MCL
(ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning
☆27Updated last year
Cuberick-Orion / CIRR
Official repository of ICCV 2021 - Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models
☆124Updated last month
thunlp / CPT
Colorful Prompt Tuning for Pre-trained Vision-Language Models
☆49Updated 3 years ago
acmi-lab / CHILS
Code and results accompanying our paper titled CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets
☆58Updated 2 years ago
LeeYN-43 / Clover
Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)
☆40Updated 2 years ago
LightDXY / MaskCLIP
☆36Updated 2 years ago
yuhangzang / UPT
☆60Updated 6 months ago
chunmeifeng / SPRC
【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval
☆90Updated last year
allenai / reclip
☆88Updated 3 years ago
yuxiaochen1103 / FDT
☆62Updated 2 years ago
tonyhuang2022 / UPL
This repo is the official implementation of UPL (Unsupervised Prompt Learning for Vision-Language Models).
☆117Updated 3 years ago
SivanDoveh / TSVLC
Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models
☆47Updated 2 years ago
vishaal27 / SuS-X
Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]
☆105Updated 2 years ago
megvii-research / protoclip
📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)
☆53Updated 2 years ago
Monoxide-Chen / uncertainty_retrieval
ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization
☆74Updated last year
microsoft / UniTAB
UniTAB: Unifying Text and Box Outputs for Grounded VL Modeling, ECCV 2022 (Oral Presentation)
☆89Updated 2 years ago
guozix / TaI-DPT
☆94Updated 2 years ago
ExplainableML / Vision_by_Language
[ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"
☆80Updated last year
IIGROUP / SCL
Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning
☆20Updated last year
pals-ttic / adapting-CLIP
☆65Updated 2 years ago
miccunifi / CIRCO
[ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset
☆78Updated 3 months ago
lezhang7 / Enhance-FineGrained
[CVPR 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding
☆53Updated 7 months ago
joeyz0z / MeaCap
(CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning
☆53Updated last year
deepglint / ALIP
[ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption
☆102Updated 2 years ago
liunian-harold-li / DesCo
☆30Updated last year
amazon-science / mix-generation
MixGen: A New Multi-Modal Data Augmentation
☆126Updated 2 years ago
facebookresearch / genecis
Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"
☆61Updated 2 years ago
joeyz0z / ConZIC
Official implementation of "ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing"
☆74Updated 2 years ago