cvzoya / visuallydataLinks
A large-scale infographics dataset from Visual.ly with metadata and additional crowdsourced annotations
☆15Updated 7 years ago
Alternatives and similar repositories for visuallydata
Users that are interested in visuallydata are comparing it to the libraries listed below
Sorting:
- Release for CHART annotation tools used for ICDAR CHART 2019 competition☆28Updated 2 years ago
- Document Visual Question Answering☆128Updated 5 years ago
- Graphical Object Detection in Document Images☆26Updated 5 years ago
- A large-scale curated dataset of Visual.ly infographics with metadata and additional crowdsourced annotations for research applications i…☆33Updated 6 years ago
- Implementation of seq2seq model for Visual Storytelling Challenge (VIST) http://visionandlanguage.net/VIST/index.html☆62Updated 7 years ago
- Detectron2 for Document Layout Analysis☆187Updated last year
- Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021☆90Updated 4 years ago
- Image Captioning Using Transformer☆271Updated 3 years ago
- BERT + Image Captioning☆135Updated 4 years ago
- Good News Everyone! - CVPR 2019☆128Updated 3 years ago
- DVQA Dataset: A Bar chart question answering dataset presented at CVPR 2018☆37Updated 6 years ago
- [CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning☆93Updated last year
- baselines for DocVQA dataset☆21Updated 4 years ago
- Quicksign OCRized Text Dataset (QS-OCR)☆45Updated 6 years ago
- Research papers and code on information extraction from image/pdf☆97Updated 3 years ago
- Code for my ICDAR paper "Deep Visual Template-Free Form Parsing"☆89Updated 3 years ago
- PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)☆246Updated 6 months ago
- Vision-Language Pre-training for Image Captioning and Question Answering☆424Updated 3 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆291Updated 3 years ago
- Implementation of BertGrid : https://arxiv.org/abs/1909.04948☆30Updated last year
- code and data for paper "One-shot Text Field Labeling using Attention and BeliefPropagation for Structure Information Extraction"☆61Updated 5 years ago
- Swire Dataset and Application Code☆17Updated 6 years ago
- Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-like Documents.☆16Updated 7 months ago
- deep learning, image retrieval, vision and language☆304Updated 4 years ago
- ☆82Updated 3 years ago
- Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.☆223Updated 4 years ago
- Learning UI Similarity using Graph Networks☆39Updated 4 years ago
- Extraction of meaningful instances from document images with a Chargrid model☆34Updated 4 years ago
- Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer☆62Updated 2 years ago
- Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]☆57Updated 3 years ago