Hxyou/MSCLIP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Hxyou/MSCLIP)

Hxyou / MSCLIP

Official Code of ECCV 2022 paper MS-CLIP

☆91

Alternatives and similar repositories for MSCLIP

Users that are interested in MSCLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MikeWangWZHL / VidIL
View on GitHub
Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
☆117Sep 15, 2022Updated 3 years ago
FuchenUSTC / AherNet
View on GitHub
☆15Aug 25, 2020Updated 5 years ago
UMass-Embodied-AGI / genome
View on GitHub
☆16Apr 10, 2025Updated last year
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
starmemda / MlTr
View on GitHub
☆43Jun 15, 2021Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Hxyou / IdealGPT
View on GitHub
Official Code of IdealGPT
☆39Mar 3, 2026Updated 4 months ago
ucasligang / SimViT
View on GitHub
[ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.
☆67Oct 11, 2022Updated 3 years ago
facebookresearch / SLIP
View on GitHub
Code release for SLIP Self-supervision meets Language-Image Pre-training
☆792Feb 9, 2023Updated 3 years ago
Chenglin-Yang / LESA_classification
View on GitHub
Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms
☆11Nov 29, 2021Updated 4 years ago
codezakh / LilT
View on GitHub
[ICLR 23] Contrastive Aligned of Vision to Language Through Parameter-Efficient Transfer Learning
☆40Jul 29, 2023Updated 2 years ago
mair-lab / mapl
View on GitHub
☆30May 27, 2023Updated 3 years ago
mshukor / eP-ALM
View on GitHub
[ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.
☆27Oct 27, 2023Updated 2 years ago
megvii-research / protoclip
View on GitHub
📍 Official repository of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS 2023)
☆56Nov 8, 2023Updated 2 years ago
guilk / VLC
View on GitHub
Research code for "Training Vision-Language Transformers from Captions Alone"
☆33Jul 15, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
woojeongjin / FewVLM
View on GitHub
A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models (ACL 2022)
☆42May 13, 2022Updated 4 years ago
alinlab / temporal-selfsupervision
View on GitHub
☆33Jul 28, 2022Updated 3 years ago
VITA-Group / Diverse-ViT
View on GitHub
[CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong C…
☆25Mar 9, 2022Updated 4 years ago
Sense-GVT / DeCLIP
View on GitHub
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
☆678Sep 19, 2022Updated 3 years ago
showlab / all-in-one
View on GitHub
[CVPR2023] All in One: Exploring Unified Video-Language Pre-training
☆281Mar 25, 2023Updated 3 years ago
PeixianChen / MEDet
View on GitHub
☆23Dec 23, 2022Updated 3 years ago
jonkahana / CLIPPR
View on GitHub
An official PyTorch implementation for CLIPPR
☆31Jul 22, 2023Updated 2 years ago
google-deepmind / svo_probes
View on GitHub
The SVO-Probes Dataset for Verb Understanding
☆29Jan 28, 2022Updated 4 years ago
facebookresearch / flip
View on GitHub
Official Open Source code for "Scaling Language-Image Pre-training via Masking"
☆428Mar 30, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
OpenGVLab / efficient-video-recognition
View on GitHub
☆184Aug 20, 2022Updated 3 years ago
microsoft / UniCL
View on GitHub
[CVPR 2022] Official code for "Unified Contrastive Learning in Image-Text-Label Space"
☆410Nov 10, 2023Updated 2 years ago
HenryHZY / VL-PET
View on GitHub
[ICCV2023] Official code for "VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control"
☆53Sep 21, 2023Updated 2 years ago
linhezheng19 / CAT
View on GitHub
Official implement of "CAT: Cross Attention in Vision Transformer".
☆169Jun 25, 2022Updated 4 years ago
mayug / 0-shot-llm-vision
View on GitHub
This repository contains the code for our CVPR 2024 paper,
☆16Aug 27, 2024Updated last year
ylsung / VL_adapter
View on GitHub
PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)
☆212Dec 18, 2022Updated 3 years ago
mshukor / ViCHA
View on GitHub
[BMVC22] Official Implementation of ViCHA: "Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment"
☆54Oct 20, 2022Updated 3 years ago
wvangansbeke / Revisiting-Contrastive-SSL
View on GitHub
Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [NeurIPS 2021]
☆89Oct 2, 2021Updated 4 years ago
VALUE-Leaderboard / DataRelease
View on GitHub
Data Release for VALUE Benchmark
☆30Feb 16, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
modestyachts / ImageNetV2_pytorch
View on GitHub
ImageNetV2 Pytorch Dataset
☆44Apr 17, 2023Updated 3 years ago
amazon-science / video-contrastive-learning
View on GitHub
Video Contrastive Learning with Global Context, ICCVW 2021
☆162May 30, 2022Updated 4 years ago
TencentARC / TVTS
View on GitHub
Turning to Video for Transcript Sorting
☆49Aug 27, 2023Updated 2 years ago
FingerRec / OA-Transformer
View on GitHub
[CVPR 2022] The code for our paper 《Object-aware Video-language Pre-training for Retrieval》
☆61May 25, 2022Updated 4 years ago
sail-sg / ptp
View on GitHub
[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》
☆150Jun 7, 2023Updated 3 years ago
salesforce / ALPRO
View on GitHub
Align and Prompt: Video-and-Language Pre-training with Entity Prompts
☆188May 1, 2025Updated last year
facebookresearch / viewseg
View on GitHub
Code for "Recognizing Scenes from Novel Viewpoints"
☆29Sep 16, 2022Updated 3 years ago