CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet
☆222Dec 16, 2022Updated 3 years ago
Alternatives and similar repositories for FT-CLIP
Users that are interested in FT-CLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆100Oct 14, 2022Updated 3 years ago
- Robust fine-tuning of zero-shot models☆763Apr 29, 2022Updated 4 years ago
- ☆579Jul 19, 2022Updated 3 years ago
- ☆678Nov 28, 2023Updated 2 years ago
- [CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners☆379Jun 1, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ECCV2022,Bootstrapped Masked Autoencoders for Vision BERT Pretraining☆97Nov 2, 2022Updated 3 years ago
- [NeurIPS 2023] Text data, code and pre-trained models for paper "Improving CLIP Training with Language Rewrites"☆289Jan 14, 2024Updated 2 years ago
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆1,041Sep 29, 2022Updated 3 years ago
- Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".☆78Jan 18, 2023Updated 3 years ago
- ☆203May 10, 2023Updated 3 years ago
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆321Jun 3, 2024Updated 2 years ago
- EVA Series: Visual Representation Fantasies from BAAI☆2,680Aug 1, 2024Updated last year
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm☆676Sep 19, 2022Updated 3 years ago
- Paper List for In-context Learning 🌷☆19Jan 3, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Exploring Visual Prompts for Adapting Large-Scale Models☆290Jun 6, 2022Updated 4 years ago
- [ICCV 2023] You Only Look at One Partial Sequence☆343Oct 21, 2023Updated 2 years ago
- Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"☆258May 3, 2024Updated 2 years ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆792Feb 9, 2023Updated 3 years ago
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training☆141Dec 16, 2025Updated 6 months ago
- [CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale☆214Feb 27, 2024Updated 2 years ago
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆53Jun 12, 2023Updated 3 years ago
- Cross-modal few-shot adaptation with CLIP☆353Apr 29, 2025Updated last year
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆44Sep 5, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An open source implementation of CLIP.☆13,939Updated this week
- Official PyTorch implementation of "Extract Free Dense Labels from CLIP" (ECCV 22 Oral)☆479Sep 19, 2022Updated 3 years ago
- BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training☆399Oct 23, 2024Updated last year
- [NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"☆387Sep 16, 2022Updated 3 years ago
- [ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning☆297Mar 13, 2024Updated 2 years ago
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆529Mar 14, 2023Updated 3 years ago
- Replication of Pix2Seq with Pretrained Model☆58Nov 6, 2021Updated 4 years ago
- ☆76Sep 30, 2022Updated 3 years ago
- An official PyTorch implementation for CLIPPR☆31Jul 22, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- SVIT: Scaling up Visual Instruction Tuning☆168Jun 20, 2024Updated 2 years ago
- Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)☆2,212May 20, 2024Updated 2 years ago
- PyTorch implementation of ICML 2023 paper "SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation"☆99Jun 28, 2023Updated 3 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,348Oct 5, 2023Updated 2 years ago
- Grounded Language-Image Pre-training☆2,604Jan 24, 2024Updated 2 years ago
- [CVPR 2022] Official code for "Unified Contrastive Learning in Image-Text-Label Space"☆410Nov 10, 2023Updated 2 years ago