KeremTurgutlu / clip_art
CLIP-Art: Contrastive Pre-training for Fine-Grained Art Classification - 4th Workshop on Computer Vision for Fashion, Art, and Design
☆27Updated 2 years ago
Alternatives and similar repositories for clip_art:
Users that are interested in clip_art are comparing it to the libraries listed below
- ☆46Updated 3 years ago
- ☆50Updated 2 years ago
- A non-JIT version implementation / replication of CLIP of OpenAI in pytorch☆34Updated 4 years ago
- Use CLIP to represent video for Retrieval Task☆69Updated 3 years ago
- ☆34Updated last year
- [NeurIPS'22] ReCo: Retrieve and Co-segment for Zero-shot Transfer☆61Updated last year
- ☆98Updated 2 months ago
- Command-line tool for downloading and extending the RedCaps dataset.☆46Updated last year
- ☆26Updated 3 years ago
- L-Verse: Bidirectional Generation Between Image and Text☆108Updated 2 years ago
- ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.☆84Updated last year
- ☆47Updated 3 years ago
- ☆31Updated 3 years ago
- A large-scale dataset for instance-level recognition for artworks is introduced.☆47Updated last year
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 3 years ago
- A task-agnostic vision-language architecture as a step towards General Purpose Vision☆92Updated 3 years ago
- PyTorch code for MUST☆106Updated last year
- [CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)☆58Updated 3 years ago
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- Tensorflow Implementation on Paper [CVPR2020]Image Search with Text Feedback by Visiolinguistic Attention Learning☆63Updated 4 years ago
- ☆62Updated 3 years ago
- Localized Narratives☆82Updated 3 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆94Updated last year
- Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [NeurIPS 2021]☆88Updated 3 years ago
- ☆69Updated last year
- ☆73Updated 2 years ago
- Official Implementation of CoSMo: Content-Style Modulation for Image Retrieval with Text Feedback presented in CVPR 2021.☆66Updated 2 years ago
- [SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval. Also, a text-video retrieval toolbox based on CLIP + fast p…☆128Updated 2 years ago
- Implementation of our PR 2020 paper:Unsupervised Text-to-Image Synthesis☆13Updated 4 years ago
- A image caption dataset about images from www.dpchallenge.com.☆12Updated 5 years ago