kevinzakka / clip_playgroundView external linksLinks
An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities
☆178Jul 27, 2022Updated 3 years ago
Alternatives and similar repositories for clip_playground
Users that are interested in clip_playground are comparing it to the libraries listed below
Sorting:
- Contrastive Language-Image Pretraining☆144Sep 6, 2022Updated 3 years ago
- Refactoring dalle-pytorch and taming-transformers for TPU VM☆60Aug 30, 2021Updated 4 years ago
- ☆64Nov 4, 2021Updated 4 years ago
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 2 years ago
- CLIPort: What and Where Pathways for Robotic Manipulation☆540Nov 2, 2023Updated 2 years ago
- Code for reproducing the experiments on large-scale pre-training and transfer learning for the paper "Effect of large-scale pre-training …☆19May 29, 2022Updated 3 years ago
- Code for "AffordanceLLM: Grounding Affordance from Vision Language Models"☆14Oct 18, 2024Updated last year
- ☆21Mar 15, 2023Updated 2 years ago
- CLOOB training (JAX) and inference (JAX and PyTorch)☆74May 16, 2022Updated 3 years ago
- When Dall E was a baby trained on a bit of data☆27Feb 26, 2021Updated 4 years ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆88Dec 3, 2021Updated 4 years ago
- codebase for the SIMAT dataset and evaluation☆38Feb 16, 2022Updated 4 years ago
- ☆12Mar 16, 2022Updated 3 years ago
- Styled text-to-drawing synthesis method. Featured at IJCAI 2022 and the 2021 NeurIPS Workshop on Machine Learning for Creativity and Desi…☆283Nov 15, 2022Updated 3 years ago
- RUDOLPH: One Hyper-Tasking Transformer can be creative as DALL-E and GPT-3 and smart as CLIP☆254Feb 6, 2023Updated 3 years ago
- ☆14May 3, 2022Updated 3 years ago
- [ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decode…☆900Aug 24, 2023Updated 2 years ago
- Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)☆861Sep 30, 2021Updated 4 years ago
- [RA-L / ICRA 2022] UMPNet: Universal Manipulation Policy Network for Articulated Objects☆59Feb 16, 2022Updated 4 years ago
- Google Colab notebooks☆43Sep 9, 2024Updated last year
- Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.☆60Mar 31, 2022Updated 3 years ago
- ☆13Dec 6, 2018Updated 7 years ago
- Neural networks do line art stylization☆14Dec 30, 2020Updated 5 years ago
- ☆15May 23, 2022Updated 3 years ago
- ☆195Dec 7, 2021Updated 4 years ago
- ☆28Dec 16, 2021Updated 4 years ago
- ☆30Jan 17, 2022Updated 4 years ago
- A simple library that implements CLIP guided loss in PyTorch.☆77Dec 25, 2021Updated 4 years ago
- ☆354May 10, 2022Updated 3 years ago
- General-purpose Visual Understanding Evaluation☆20Dec 21, 2023Updated 2 years ago
- ☆17Dec 13, 2023Updated 2 years ago
- L-Verse: Bidirectional Generation Between Image and Text☆107Apr 1, 2025Updated 10 months ago
- Linear image-to-image translation☆41Jul 31, 2020Updated 5 years ago
- Repository for "Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search"☆179Sep 30, 2021Updated 4 years ago
- [ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383☆421Oct 28, 2022Updated 3 years ago
- https://www.kaggle.com/c/rsna-intracranial-hemorrhage-detection/☆19Oct 20, 2019Updated 6 years ago
- Modified fork of Xuebin Qin's U-2-Net Repository. Used for demonstration purposes.☆17Jun 30, 2021Updated 4 years ago
- [CVPR 2024 CVinW] Multi-Agent VQA: Exploring Multi-Agent Foundation Models on Zero-Shot Visual Question Answering☆20Sep 21, 2024Updated last year
- Code for ACL 2023 Paper: ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER☆21Jul 19, 2023Updated 2 years ago