filipbasara0 / simple-clip
A minimal, but effective implementation of CLIP (Contrastive Language-Image Pretraining) in PyTorch
☆27Updated 11 months ago
Alternatives and similar repositories for simple-clip:
Users that are interested in simple-clip are comparing it to the libraries listed below
- official repo for the paper "EXIF as Language: Learning Cross-Modal Associations Between Images and Camera Metadata"☆44Updated last year
- [ICCV 2023] Zero-shot image editing with stochastic diffusion models☆50Updated last year
- [CVPR '23] Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models☆36Updated 9 months ago
- ☆52Updated last year
- ☆22Updated 3 months ago
- ICCV2023-Diffusion-Papers☆109Updated last year
- fixed official code for paper "A Closer Look at Parameter-Efficient Tuning in Diffusion Models".☆40Updated last year
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆68Updated last year
- Code for the paper "Do text-free diffusion models learn discriminative visual representations?"☆21Updated last year
- Perceptual Artifacts Localization for Image Synthesis Tasks (ICCV 23')☆51Updated last year
- [NeurIPS-2022] Annual Conference on Neural Information Processing Systems☆18Updated last year
- Code for paper "Unsegment Anything by Simulating Deformation" (CVPR 2024)☆25Updated 7 months ago
- The official github repo for "Test-Time Training with Masked Autoencoders"☆80Updated last year
- Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training☆15Updated last year
- [ECCV 2024] Official implementation of the paper "Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning…☆23Updated 5 months ago
- ☆47Updated 9 months ago
- [CVPR 2024 Highlight] ImageNet-D☆40Updated 3 months ago
- PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"☆48Updated 2 years ago
- Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation (ICCV 2023)☆63Updated last year
- PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis (CVPR2024 Highlight)☆36Updated 10 months ago
- ☆44Updated last year
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Updated last year
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆20Updated 2 months ago
- ☆21Updated last month
- ☆29Updated last year
- ☆39Updated last year
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆47Updated 8 months ago
- ☆52Updated 9 months ago