BestiVictory / DPC-CaptionsLinks
A image caption dataset about images from www.dpchallenge.com.
☆16Updated 5 years ago
Alternatives and similar repositories for DPC-Captions
Users that are interested in DPC-Captions are comparing it to the libraries listed below
Sorting:
- Code and data for Aesthetic Image Captioning from Weakly-Labelled Photographs☆33Updated 5 years ago
- Code for the paper "Understanding Aesthetics with Language: A Photo Critique Dataset for Aesthetic Assessment"☆94Updated 2 years ago
- ☆29Updated 4 years ago
- Code for ICPR paper☆22Updated 3 years ago
- Aesthetic Critiques Generation for Photos (ICCV17)☆20Updated 2 years ago
- Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…☆37Updated 3 years ago
- Official Code of ECCV 2022 paper MS-CLIP☆90Updated 3 years ago
- ☆46Updated 4 years ago
- ☆34Updated 2 years ago
- CVPR2023 paper☆52Updated last year
- Tensorflow Implementation on Paper [CVPR2020]Image Search with Text Feedback by Visiolinguistic Attention Learning☆63Updated 4 years ago
- Use CLIP to represent video for Retrieval Task☆70Updated 4 years ago
- CLIP-Art: Contrastive Pre-training for Fine-Grained Art Classification - 4th Workshop on Computer Vision for Fashion, Art, and Design☆27Updated 3 years ago
- ☆26Updated 4 years ago
- EVA: An Explainable Visual Aesthetics Dataset☆26Updated 2 years ago
- Official Implementation of CoSMo: Content-Style Modulation for Image Retrieval with Text Feedback presented in CVPR 2021.☆66Updated 2 years ago
- Official code of "StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis" (CVPR 2022)☆42Updated 3 years ago
- A list of Image Aesthetics papers and resources.☆74Updated 5 years ago
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Updated 4 years ago
- [ACM MM 2022] Towards Counterfactual Image Manipulation via CLIP☆38Updated 3 years ago
- Implementation of our PR 2020 paper:Unsupervised Text-to-Image Synthesis☆13Updated 5 years ago
- ☆17Updated 5 years ago
- Code for the Video Similarity Challenge.☆81Updated last year
- AQUA dataset and VIKING model for the task of Art Visual Question Answering☆26Updated 4 years ago
- [ECCV 2022] FashionViL: Fashion-Focused V+L Representation Learning☆61Updated 2 years ago
- ☆31Updated 3 years ago
- ☆20Updated 4 years ago
- Modality-Agnostic Attention Fusion for visual search with text feedback☆25Updated 2 years ago
- Dual-Branch Network for Portrait Image Quality Assessment☆16Updated last year
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Updated 3 years ago