cardinalblue / clip-models-for-distillationLinks

☆19

Alternatives and similar repositories for clip-models-for-distillation

Users that are interested in clip-models-for-distillation are comparing it to the libraries listed below

Sorting:

Deferf / CLIP_Video_Representation
Use CLIP to represent video for Retrieval Task
☆70Updated 4 years ago
guilk / VLC
Research code for "Training Vision-Language Transformers from Captions Alone"
☆34Updated 3 years ago
01BB01 / eBayChallenge
[FGVC9-CVPR 2022] The second place solution for 2nd eBay eProduct Visual Search Challenge.
☆26Updated 2 years ago
VideoNetworks / TokShift-Transformer
☆72Updated last year
YuanEZhou / satic
☆26Updated 4 years ago
papermsucode / mdmmt
MDMMT: Multidomain Multimodal Transformer for Video Retrieval
☆26Updated 4 years ago
princetonvisualai / SPICE-U
☆11Updated 4 years ago
hq03 / FoodLogoDet-1500-Dataset
☆28Updated 3 years ago
microsoft / GEM
☆24Updated 4 years ago
ecom-research / ComposeAE
Official code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval
☆57Updated 3 years ago
intersun / LightningDOT
source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT
☆72Updated 2 years ago
Zasder3 / train-CLIP-FT
☆46Updated 3 years ago
YehLi / TDEN
☆9Updated 2 years ago
facebookresearch / connect-caption-and-trace
A unified framework to jointly model images, text, and human attention traces.
☆78Updated 4 years ago
zerovl / ZeroVL
[ECCV2022] Contrastive Vision-Language Pre-training with Limited Resources
☆45Updated 2 years ago
yonatanbitton / data_efficient_masked_language_modeling_for_vision_and_language
Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".
☆18Updated 3 years ago
Cadene / vqa-maskrcnn-benchmark
☆20Updated 3 years ago
weiyx16 / CLIP-pytorch
A non-JIT version implementation / replication of CLIP of OpenAI in pytorch
☆34Updated 4 years ago
xuewyang / Fashion_Captioning
ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.
☆85Updated 2 years ago
josiahwang / phraseloceval
Phrase Localization Evaluation Toolkit
☆20Updated 5 years ago
seungkee / 2nd-place-solution-to-Facebook-Image-Similarity-Matching-Track
☆29Updated 3 years ago
allenai / gpv2
☆32Updated 3 years ago
facebookresearch / vsc2022
Code for the Video Similarity Challenge.
☆81Updated last year
yanbeic / VAL
Tensorflow Implementation on Paper [CVPR2020]Image Search with Text Feedback by Visiolinguistic Attention Learning
☆63Updated 4 years ago
ucasligang / SimViT
[ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.
☆68Updated 2 years ago
CupidJay / MoCov3-pytorch
custom pytorch implementation of MoCo v3
☆46Updated 4 years ago
allenai / gpv-1
A task-agnostic vision-language architecture as a step towards General Purpose Vision
☆92Updated 4 years ago
MILVLG / rosita
ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration
☆56Updated 2 years ago
medhini / clip_it
CLIP-It! Language-Guided Video Summarization
☆74Updated 4 years ago
lucidrains / omninet-pytorch
Implementation of OmniNet, Omnidirectional Representations from Transformers, in Pytorch
☆58Updated 4 years ago