JosephPai / Art-DescriptionLinks
Code for paper <Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation> in ICCV 2021.
☆13Updated 3 years ago
Alternatives and similar repositories for Art-Description
Users that are interested in Art-Description are comparing it to the libraries listed below
Sorting:
- Repository for the data in the paper "Explain Me the Painting: Multi-TopicKnowledgeable Art Description Generation".☆20Updated 3 years ago
- An interactive demo based on Segment-Anything for stroke-based painting which enables human-like painting.☆35Updated 2 years ago
- CVPR2023 paper☆51Updated last year
- Masked Vision-Language Transformer in Fashion☆33Updated last year
- A curated list of papers and resources for text-to-image evaluation.☆29Updated last year
- DoodleFormer: Creative Sketch Drawing with Transformers (ECCV22)☆29Updated 2 years ago
- Source code of the TextLap model, a LLM for text-2-layout generation.☆15Updated 8 months ago
- ☆29Updated 2 years ago
- Official code of paper: MovingFashion: a Benchmark for the Video-to-Shop Challenge☆46Updated last year
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆34Updated last year
- Edit and Generate Anything in 3D world!☆13Updated 2 years ago
- Official code repo for "Editing Implicit Assumptions in Text-to-Image Diffusion Models"☆86Updated 2 years ago
- ☆24Updated 2 years ago
- A large scale dataset for Video Captioning in Italian☆12Updated 2 years ago
- ☆17Updated 2 years ago
- Accepted by AAAI2022☆21Updated 3 years ago
- Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…☆37Updated 3 years ago
- Code of our paper "A Unified Agentic Framework for Evaluating Conditional Image Generation".☆25Updated 3 months ago
- [FGVC9-CVPR 2022] The second place solution for 2nd eBay eProduct Visual Search Challenge.☆26Updated 2 years ago
- Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"☆29Updated last year
- Code for the Video Similarity Challenge.☆81Updated last year
- Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.☆22Updated last year
- A tool for benchmarking image generation models.☆33Updated 2 years ago
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆26Updated last year
- [TMM 2022] ISF-GAN.☆18Updated 6 months ago
- [CVPR 2022 Challenge Rank 1st] The official code for V2L: Leveraging Vision and Vision-language Models into Large-scale Product Retrieval…☆29Updated 2 years ago
- Awesome-DragGAN: A curated list of papers, tutorials, repositories related to DragGAN☆85Updated last year
- 💭 Intentonomy: towards Human Intent Understanding [CVPR 2021]☆37Updated 3 years ago
- TensorFlow implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"☆35Updated 3 years ago
- My Implementation of " Structure and Content-Guided Video Synthesis with Diffusion Models" by RunwayML☆26Updated last year