google-research-datasets / uibertLinks

It includes two datasets that are used in the downstream tasks for evaluating UIBert: App Similar Element Retrieval data and Visual Item Selection (VIS) data. Both datasets are written TFRecords.

☆44

Alternatives and similar repositories for uibert

Users that are interested in uibert are comparing it to the libraries listed below

Sorting:

chenjshnn / Object-Detection-for-Graphical-User-Interface
Object Detection for Graphical User Interface: Old Fashioned or Deep Learning or a Combination?
☆127Updated last year
google-research-datasets / clay
The dataset includes UI object type labels (e.g., BUTTON, IMAGE, CHECKBOX) that describes the semantic type of an UI object on Android ap…
☆52Updated 3 years ago
google-research-datasets / screen2words
The dataset includes screen summaries that describes Android app screenshot's functionalities. It is used for training and evaluation of …
☆58Updated 4 years ago
google-research-datasets / seq2act
This repository contains the opensource version of the datasets were used for different parts of training and testing of models that grou…
☆32Updated 4 years ago
luileito / enrico
A curated mobile app design database
☆61Updated 3 years ago
tobyli / Screen2Vec
Screen2Vec is a new self-supervised technique for generating more comprehensive semantic embeddings of GUI screens and components using t…
☆75Updated 6 months ago
sbunian / VINS
VINS: Visual Search for Mobile User Interface Design
☆42Updated 4 years ago
aburns4 / MoTIF
Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments
☆61Updated 11 months ago
kevalmorabia97 / CoVA-Web-Object-Detection
A Context-aware Visual Attention-based training pipeline for Object Detection from a Webpage screenshot!
☆93Updated 5 months ago
NiteshMethani / PlotQA
Dataset introduced in PlotQA: Reasoning over Scientific Plots
☆79Updated 2 years ago
deepneuralmachine / seq2act-tensorflow
Seq2act: Mapping Natural Language Instructions to Mobile UI Action Sequences from Google research
☆14Updated 5 years ago
google-research-datasets / widget-caption
The dataset includes widget captions that describes UI element's functionalities. It is used for training and evaluation of the widget ca…
☆22Updated 4 years ago
google-research-datasets / rico_semantics
Consists of ~500k human annotations on the RICO dataset identifying various icons based on their shapes and semantics, and associations b…
☆28Updated last year
MulongXie / GUI-Perceptual-Grouping
Recognize graphic user interface layout through grouping GUI elements according to their visual attributes
☆45Updated 3 years ago
datadrivendesign / semantic-icon-classifier
☆35Updated 2 years ago
js0nwu / webui
☆120Updated last year
due-benchmark / baselines
The code related to the baselines from NeurIPS 2021 paper "DUE: End-to-End Document Understanding Benchmark."
☆36Updated 2 years ago
dips4717 / gcn-cnn
Learning UI Similarity using Graph Networks
☆38Updated 4 years ago
applicaai / CCpdf
Index of URLs to pdf files all over the internet and scripts
☆24Updated 2 years ago
google-deepmind / pix2act
☆59Updated last year
kdavila / ChartInfo_annotation_tools
Release for CHART annotation tools used for ICDAR CHART 2019 competition
☆28Updated last year
shuyanzhou / docprompting
Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023
☆248Updated last year
JasonObeid / Chart2Text
Chart-to-Text: Generating Natural Language Explanations for Charts by Adapting the Transformer Model
☆156Updated 2 years ago
eric-ai-lab / Screen-Point-and-Read
Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"
☆28Updated last year
Anni-Zou / DocBench
DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems
☆42Updated 10 months ago
reasoning-machines / prompt-lib
A set of utilities for running few-shot prompting experiments on large-language models
☆122Updated last year
X-LANCE / TIE
[NAACL 2022] TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages
☆20Updated 3 years ago
nttmdlab-nlp / SlideVQA
SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)
☆92Updated 4 months ago
reasoning-machines / CoCoGen
Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022)
☆86Updated 2 years ago
vis-nlp / Chart-to-text
☆115Updated last year