filipbasara0 / simple-clip
A minimal, but effective implementation of CLIP (Contrastive Language-Image Pretraining) in PyTorch
☆27Updated last year
Alternatives and similar repositories for simple-clip:
Users that are interested in simple-clip are comparing it to the libraries listed below
- Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training☆16Updated 2 months ago
- [CVPR '23] Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models☆36Updated last year
- official repo for the paper "EXIF as Language: Learning Cross-Modal Associations Between Images and Camera Metadata"☆45Updated last year
- (CVPR 2024) "Unsegment Anything by Simulating Deformation"☆27Updated 10 months ago
- ☆52Updated 2 years ago
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆44Updated this week
- Official implementation of LaVin-DiT☆26Updated 2 months ago
- [NeurIPS-2022] Annual Conference on Neural Information Processing Systems☆18Updated last year
- The official github repo for "Test-Time Training with Masked Autoencoders"☆80Updated last year
- [NeurIPS 2024] SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow☆26Updated 4 months ago
- PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"☆48Updated 2 years ago
- [ECCV 2024] Official implementation of the paper "Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning…☆26Updated 3 weeks ago
- Code of "What Images are More Memorable to Machines?"☆15Updated 2 years ago
- DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models☆46Updated last year
- ☆43Updated last year
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆22Updated 4 months ago
- ☆23Updated 5 months ago
- [CVPR 2024 Highlight] ImageNet-D☆41Updated 5 months ago
- Augmenting with Language-guided Image Augmentation (ALIA)☆75Updated last year
- [CVPR'24] Validation-free few-shot adaptation of CLIP, using a well-initialized Linear Probe (ZSLP) and class-adaptive constraints (CLAP)…☆69Updated 10 months ago
- [NeurIPS 2022] code for the paper, SemMAE: Semantic-guided masking for learning masked autoencoders☆35Updated last year
- ☆14Updated last year
- [TIP] Exploring Effective Factors for Improving Visual In-Context Learning☆19Updated 5 months ago
- ☆42Updated last year
- Pytorch implementation of Mix-Shifting-MLP (MS-MLP)☆16Updated 3 years ago
- ☆19Updated last year
- [ICCV 2023] Zero-shot image editing with stochastic diffusion models☆55Updated last year
- ☆20Updated last year
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆74Updated last year
- i-mae Pytorch Repo☆20Updated 11 months ago