cvzoya / visuallydata
A large-scale infographics dataset from Visual.ly with metadata and additional crowdsourced annotations
☆12Updated 5 years ago
Related projects: ⓘ
- A large-scale curated dataset of Visual.ly infographics with metadata and additional crowdsourced annotations for research applications i…☆29Updated 5 years ago
- Release for CHART annotation tools used for ICDAR CHART 2019 competition☆25Updated last year
- Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021☆69Updated 3 years ago
- Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-like Documents.☆16Updated last year
- ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.☆81Updated last year
- Pytorch implementation for pixel-wise scene text segmentation based on DeepLabV3+☆11Updated 4 years ago
- Implementation of seq2seq model for Visual Storytelling Challenge (VIST) http://visionandlanguage.net/VIST/index.html☆58Updated 6 years ago
- DVQA Dataset: A Bar chart question answering dataset presented at CVPR 2018☆31Updated 5 years ago
- Swire Dataset and Application Code☆17Updated 5 years ago
- ☆9Updated 2 years ago
- An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…☆52Updated 8 months ago
- Learning UI Similarity using Graph Networks☆34Updated 3 years ago
- Document Visual Question Answering☆110Updated 4 years ago
- ☆77Updated last year
- Scene Text Aware Cross Modal Retrieval (StacMR)☆24Updated 3 years ago
- A dataset of crowdsourced ratings for machine-generated image captions☆31Updated 5 years ago
- CLIP-Art: Contrastive Pre-training for Fine-Grained Art Classification - 4th Workshop on Computer Vision for Fashion, Art, and Design☆27Updated 2 years ago
- [CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning☆89Updated 5 months ago
- Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]☆56Updated 2 years ago
- AQUA dataset and VIKING model for the task of Art Visual Question Answering☆21Updated 3 years ago
- It includes two datasets that are used in the downstream tasks for evaluating UIBert: App Similar Element Retrieval data and Visual Item …☆41Updated 3 years ago
- VINS: Visual Search for Mobile User Interface Design☆26Updated 3 years ago
- A reproducing for article LayoutGAN: Generating Graphic Layouts with Wireframe Discriminators.☆75Updated 5 years ago
- Textual Visual Semantic Dataset for Text Spotting. CVPRW 2020☆9Updated 2 years ago
- Towards Flexible Multi-modal Document Models [Inoue+, CVPR2023]☆55Updated last year
- baselines for DocVQA dataset☆21Updated 3 years ago
- Image Captioning through Image Transformer☆40Updated 3 years ago
- A unified framework to jointly model images, text, and human attention traces.☆78Updated 3 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 2 years ago
- Official code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval☆55Updated 2 years ago