apple / ml-tic-clip
Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models".
☆95Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for ml-tic-clip
- Object Recognition as Next Token Prediction (CVPR 2024 Highlight)☆161Updated 2 months ago
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆112Updated 3 months ago
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆97Updated 2 months ago
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)☆109Updated 7 months ago
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆98Updated 6 months ago
- [ECCV 2024] Official Release of SILC: Improving vision language pretraining with self-distillation☆36Updated last month
- [CVPR24] Official Implementation of GEM (Grounding Everything Module)☆86Updated last month
- ☆58Updated 8 months ago
- Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.☆88Updated 8 months ago
- ☆33Updated 4 months ago
- Code release for "Improved baselines for vision-language pre-training"☆57Updated 6 months ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆34Updated 2 weeks ago
- This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"☆121Updated 5 months ago
- This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.or…☆107Updated 4 months ago
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆49Updated 2 months ago
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆24Updated 5 months ago
- ☆70Updated 10 months ago
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆37Updated last year
- ☆30Updated 9 months ago
- A light-weight implementation of ICCV2023 paper "Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Rei…☆78Updated last year
- Official repository for the General Robust Image Task (GRIT) Benchmark☆50Updated last year
- ☆32Updated this week
- [CVPR 2023 Highlight] Beyond mAP: Towards better evaluation of instance segmentation☆26Updated last year
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆34Updated last year
- [ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specific…☆57Updated 2 months ago
- [CVPR 2023] HierVL Learning Hierarchical Video-Language Embeddings☆44Updated last year
- [IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation☆112Updated last month
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆81Updated 4 months ago
- Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models☆70Updated 2 months ago
- The official repo for the paper "VeCLIP: Improving CLIP Training via Visual-enriched Captions"☆229Updated 3 months ago