An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities
☆178Jul 27, 2022Updated 3 years ago
Alternatives and similar repositories for clip_playground
Users that are interested in clip_playground are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Contrastive Language-Image Pretraining☆146Sep 6, 2022Updated 3 years ago
- ☆65Nov 4, 2021Updated 4 years ago
- CLIPort: What and Where Pathways for Robotic Manipulation☆544Nov 2, 2023Updated 2 years ago
- ☆61Jul 11, 2024Updated last year
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for "AffordanceLLM: Grounding Affordance from Vision Language Models"☆14Oct 18, 2024Updated last year
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆89Dec 3, 2021Updated 4 years ago
- General-purpose Visual Understanding Evaluation☆20Dec 21, 2023Updated 2 years ago
- [ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decode…☆906Aug 24, 2023Updated 2 years ago
- ☆13Dec 6, 2018Updated 7 years ago
- ☆21Mar 15, 2023Updated 3 years ago
- Plotting heatmaps with the self-attention of the [CLS] tokens in the last layer.☆50May 11, 2022Updated 3 years ago
- Code for reproducing the experiments on large-scale pre-training and transfer learning for the paper "Effect of large-scale pre-training …☆19May 29, 2022Updated 3 years ago
- CLOOB training (JAX) and inference (JAX and PyTorch)☆74May 16, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Using LLMs and pre-trained caption models for super-human performance on image captioning.☆42Oct 13, 2023Updated 2 years ago
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆31May 29, 2023Updated 2 years ago
- [RA-L / ICRA 2022] UMPNet: Universal Manipulation Policy Network for Articulated Objects☆59Feb 16, 2022Updated 4 years ago
- Data repository for the VALSE benchmark.☆38Feb 15, 2024Updated 2 years ago
- [NeurIPS 24] A new training and evaluation framework for learning interpretable deep vision models and benchmarking different interpretab…☆31Jun 5, 2025Updated 10 months ago
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆18Jun 3, 2025Updated 10 months ago
- Easily compute clip embeddings and build a clip retrieval system with them☆2,749Mar 28, 2026Updated 3 weeks ago
- ☆195Dec 7, 2021Updated 4 years ago
- Repository for "Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search"☆179Sep 30, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Mar 16, 2022Updated 4 years ago
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)☆211Dec 18, 2022Updated 3 years ago
- ☆14May 3, 2022Updated 3 years ago
- ☆91Apr 15, 2022Updated 4 years ago
- A simple library that implements CLIP guided loss in PyTorch.☆77Dec 25, 2021Updated 4 years ago
- Styled text-to-drawing synthesis method. Featured at IJCAI 2022 and the 2021 NeurIPS Workshop on Machine Learning for Creativity and Desi…☆282Nov 15, 2022Updated 3 years ago
- RUDOLPH: One Hyper-Tasking Transformer can be creative as DALL-E and GPT-3 and smart as CLIP☆253Feb 6, 2023Updated 3 years ago
- ☆30Jan 17, 2022Updated 4 years ago
- ☆17Dec 16, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆354May 10, 2022Updated 3 years ago
- ContextBLIP : Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions☆11May 17, 2024Updated last year
- When Dall E was a baby trained on a bit of data☆27Feb 26, 2021Updated 5 years ago
- This is the GPT2 baseline for ProtoQA☆12Jan 3, 2022Updated 4 years ago
- PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)☆246Jun 10, 2025Updated 10 months ago
- [ICLR 2025] Official code repository for "TULIP: Token-length Upgraded CLIP"☆32Jan 26, 2026Updated 2 months ago
- Majesty Diffusion by @Dango233(@Dango233max) and @apolinario (@multimodalart)☆276Jul 25, 2022Updated 3 years ago