BestiVictory / DPC-CaptionsLinks
A image caption dataset about images from www.dpchallenge.com.
☆17Updated 5 years ago
Alternatives and similar repositories for DPC-Captions
Users that are interested in DPC-Captions are comparing it to the libraries listed below
Sorting:
- Code and data for Aesthetic Image Captioning from Weakly-Labelled Photographs☆34Updated 6 years ago
- Code for the paper "Understanding Aesthetics with Language: A Photo Critique Dataset for Aesthetic Assessment"☆96Updated 2 years ago
- ☆29Updated 4 years ago
- Aesthetic Critiques Generation for Photos (ICCV17)☆20Updated 3 years ago
- Code for ICPR paper☆22Updated 3 years ago
- ☆34Updated 2 years ago
- ☆48Updated 4 years ago
- EVA: An Explainable Visual Aesthetics Dataset☆26Updated 3 years ago
- ☆17Updated 5 years ago
- [ACM MM 2022] Towards Counterfactual Image Manipulation via CLIP☆38Updated 3 years ago
- Code for the Video Similarity Challenge.☆80Updated last year
- Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…☆37Updated 3 years ago
- Official Code of ECCV 2022 paper MS-CLIP☆90Updated 3 years ago
- Official code of "StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis" (CVPR 2022)☆43Updated 3 years ago
- L-Verse: Bidirectional Generation Between Image and Text☆109Updated 6 months ago
- The official implementation of MP_{ada} in Attention-based Multi-patch Aggregation for Image Aesthetic Assessment (MM 2018)☆84Updated 4 years ago
- A list of Image Aesthetics papers and resources.☆74Updated 5 years ago
- Implementation of our PR 2020 paper:Unsupervised Text-to-Image Synthesis☆13Updated 5 years ago
- ☆20Updated 4 years ago
- FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions☆55Updated last year
- IAE Dataset, produced by Chaoran Cui, Zhen Shen, Jun Yu. A large scale dataset to facilitate multi-task learning for unified image aesthet…☆19Updated 4 years ago
- Tensorflow Implementation on Paper [CVPR2020]Image Search with Text Feedback by Visiolinguistic Attention Learning☆63Updated 5 years ago
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Updated 4 years ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆81Updated 2 years ago
- Evaluation benchmark for the task of Semantic Image Translation. Contains code to run FlexIT (CVPR 2022)☆34Updated 3 years ago
- Official code repo for "Editing Implicit Assumptions in Text-to-Image Diffusion Models"☆87Updated 2 years ago
- ☆53Updated 3 years ago
- ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.☆85Updated 2 years ago
- Use CLIP to represent video for Retrieval Task☆70Updated 4 years ago
- ☆26Updated 4 years ago