An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities
☆178Jul 27, 2022Updated 3 years ago
Alternatives and similar repositories for clip_playground
Users that are interested in clip_playground are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15May 23, 2022Updated 3 years ago
- ☆65Nov 4, 2021Updated 4 years ago
- CLIPort: What and Where Pathways for Robotic Manipulation☆541Nov 2, 2023Updated 2 years ago
- ☆61Jul 11, 2024Updated last year
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code for "AffordanceLLM: Grounding Affordance from Vision Language Models"☆14Oct 18, 2024Updated last year
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆89Dec 3, 2021Updated 4 years ago
- General-purpose Visual Understanding Evaluation☆20Dec 21, 2023Updated 2 years ago
- Refactoring dalle-pytorch and taming-transformers for TPU VM☆60Aug 30, 2021Updated 4 years ago
- [ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decode…☆903Aug 24, 2023Updated 2 years ago
- ☆13Dec 6, 2018Updated 7 years ago
- ☆21Mar 15, 2023Updated 3 years ago
- Plotting heatmaps with the self-attention of the [CLS] tokens in the last layer.☆50May 11, 2022Updated 3 years ago
- Code for reproducing the experiments on large-scale pre-training and transfer learning for the paper "Effect of large-scale pre-training …☆19May 29, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- CLOOB training (JAX) and inference (JAX and PyTorch)☆74May 16, 2022Updated 3 years ago
- Using LLMs and pre-trained caption models for super-human performance on image captioning.☆42Oct 13, 2023Updated 2 years ago
- Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)☆862Sep 30, 2021Updated 4 years ago
- [RA-L / ICRA 2022] UMPNet: Universal Manipulation Policy Network for Articulated Objects☆59Feb 16, 2022Updated 4 years ago
- Data repository for the VALSE benchmark.☆38Feb 15, 2024Updated 2 years ago
- SNARE Dataset with MATCH and LaGOR models☆23Mar 27, 2024Updated 2 years ago
- ☆17Dec 13, 2023Updated 2 years ago
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆18Jun 3, 2025Updated 9 months ago
- Easily compute clip embeddings and build a clip retrieval system with them☆2,736Updated this week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆195Dec 7, 2021Updated 4 years ago
- An easy way to start a python programming environment using GitHub Codespaces.☆15Sep 9, 2020Updated 5 years ago
- Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.☆60Mar 31, 2022Updated 4 years ago
- Repository for "Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search"☆179Sep 30, 2021Updated 4 years ago
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)☆211Dec 18, 2022Updated 3 years ago
- ☆12Mar 16, 2022Updated 4 years ago
- ☆14May 3, 2022Updated 3 years ago
- ☆87Apr 15, 2022Updated 3 years ago
- A simple library that implements CLIP guided loss in PyTorch.☆77Dec 25, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- RUDOLPH: One Hyper-Tasking Transformer can be creative as DALL-E and GPT-3 and smart as CLIP☆253Feb 6, 2023Updated 3 years ago
- ☆30Jan 17, 2022Updated 4 years ago
- ☆17Dec 16, 2022Updated 3 years ago
- ☆354May 10, 2022Updated 3 years ago
- ContextBLIP : Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions☆11May 17, 2024Updated last year
- When Dall E was a baby trained on a bit of data☆27Feb 26, 2021Updated 5 years ago
- This is the GPT2 baseline for ProtoQA☆12Jan 3, 2022Updated 4 years ago