diviz-mit / visuallydataLinks
A large-scale curated dataset of Visual.ly infographics with metadata and additional crowdsourced annotations for research applications in computer vision and natural language processing.
☆31Updated 6 years ago
Alternatives and similar repositories for visuallydata
Users that are interested in visuallydata are comparing it to the libraries listed below
Sorting:
- Release for CHART annotation tools used for ICDAR CHART 2019 competition☆28Updated 2 years ago
- DVQA Dataset: A Bar chart question answering dataset presented at CVPR 2018☆37Updated 6 years ago
- A large-scale infographics dataset from Visual.ly with metadata and additional crowdsourced annotations☆15Updated 7 years ago
- ☆27Updated 6 years ago
- ☆145Updated 2 years ago
- AQUA dataset and VIKING model for the task of Art Visual Question Answering☆27Updated 4 years ago
- Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021☆90Updated 4 years ago
- CLIP-Art: Contrastive Pre-training for Fine-Grained Art Classification - 4th Workshop on Computer Vision for Fashion, Art, and Design☆28Updated 3 years ago
- Learning UI Similarity using Graph Networks☆39Updated 4 years ago
- A collection of models for image<->text generation in ACM MM 2021.☆67Updated 4 years ago
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Updated 4 years ago
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆42Updated 2 years ago
- ☆44Updated 4 years ago
- ☆29Updated 5 years ago
- Document Visual Question Answering☆128Updated 5 years ago
- Tornado is an open source Human-in-the-loop machine learning tool. It helps you label your dataset on the fly while training your model t…☆67Updated 2 years ago
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆103Updated 8 months ago
- The dataset includes UI object type labels (e.g., BUTTON, IMAGE, CHECKBOX) that describes the semantic type of an UI object on Android ap…☆53Updated 3 years ago
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆82Updated 2 years ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆82Updated 2 years ago
- Synthetic Dataset used in the ICDAR2019 Competition on HArvesting Raw Tables from Infographics (CHART-Infographics)☆23Updated 6 years ago
- ☆22Updated 4 years ago
- Pytorch implementation of LayoutGMN.☆47Updated 3 years ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 7 months ago
- Official Github Repo for the Findings of EMNLP 2021 paper "An animated picture says at least a thousand words: Selecting Gif-based Replie…☆32Updated 4 years ago
- ☆22Updated 6 years ago
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆80Updated 2 years ago
- ☆48Updated 4 years ago
- Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR☆80Updated 3 years ago
- A image caption dataset about images from www.dpchallenge.com.☆19Updated 5 years ago