JosephPai / Art-DescriptionLinks

Code for paper <Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation> in ICCV 2021.

☆13

Alternatives and similar repositories for Art-Description

Users that are interested in Art-Description are comparing it to the libraries listed below

Sorting:

noagarcia / explain-paintings
Repository for the data in the paper "Explain Me the Painting: Multi-TopicKnowledgeable Art Description Generation".
☆20Updated 3 years ago
Huage001 / Paint-Anything
An interactive demo based on Segment-Anything for stroke-based painting which enables human-like painting.
☆35Updated 2 years ago
hssip / FashionSAP
CVPR2023 paper
☆51Updated last year
GewelsJI / MVLT
Masked Vision-Language Transformer in Fashion
☆33Updated last year
zhangjiewu / awesome-t2i-eval
A curated list of papers and resources for text-to-image evaluation.
☆29Updated last year
ankanbhunia / doodleformer
DoodleFormer: Creative Sketch Drawing with Transformers (ECCV22)
☆29Updated 2 years ago
puar-playground / TextLap
Source code of the TextLap model, a LLM for text-2-layout generation.
☆15Updated 8 months ago
pfnet-research / multi-stage-blended-diffusion
☆29Updated 2 years ago
HumaticsLAB / SEAM-Match-RCNN
Official code of paper: MovingFashion: a Benchmark for the Video-to-Shop Challenge
☆46Updated last year
jialuli-luka / SELMA
Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
☆34Updated last year
showlab / Show-Anything-3D
Edit and Generate Anything in 3D world!
☆13Updated 2 years ago
bahjat-kawar / time-diffusion
Official code repo for "Editing Implicit Assumptions in Text-to-Image Diffusion Models"
☆86Updated 2 years ago
ml-jku / semantic-image-text-alignment
☆24Updated 2 years ago
crux82 / msr-vtt-it
A large scale dataset for Video Captioning in Italian
☆12Updated 2 years ago
EdiBERT4ImageManipulation / EdiBERT
☆17Updated 2 years ago
neuralchen / Bivolution
Accepted by AAAI2022
☆21Updated 3 years ago
zipengxuc / PPE
Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…
☆37Updated 3 years ago
HITsz-TMG / Agentic-CIGEval
Code of our paper "A Unified Agentic Framework for Evaluating Conditional Image Generation".
☆25Updated 3 months ago
01BB01 / eBayChallenge
[FGVC9-CVPR 2022] The second place solution for 2nd eBay eProduct Visual Search Challenge.
☆26Updated 2 years ago
eric-ai-lab / Discffusion
Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"
☆29Updated last year
facebookresearch / vsc2022
Code for the Video Similarity Challenge.
☆81Updated last year
CyberAgentAILab / webcolor
Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.
☆22Updated last year
nousr / dream-bench
A tool for benchmarking image generation models.
☆33Updated 2 years ago
callsys / TextVR
[PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension
☆26Updated last year
yhlleo / stylegan-mmuit
[TMM 2022] ISF-GAN.
☆18Updated 6 months ago
WangWenhao0716 / V2L
[CVPR 2022 Challenge Rank 1st] The official code for V2L: Leveraging Vision and Vision-language Models into Large-scale Product Retrieval…
☆29Updated 2 years ago
OpenGVLab / Awesome-DragGAN
Awesome-DragGAN: A curated list of papers, tutorials, repositories related to DragGAN
☆85Updated last year
KMnP / intentonomy
💭 Intentonomy: towards Human Intent Understanding [CVPR 2021]
☆37Updated 3 years ago
ariG23498 / TokenLearner
TensorFlow implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"
☆35Updated 3 years ago
kyegomez / Gen1
My Implementation of " Structure and Content-Guided Video Synthesis with Diffusion Models" by RunwayML
☆26Updated last year