google-research-datasets/widget-caption

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google-research-datasets/widget-caption)

google-research-datasets / widget-caption

The dataset includes widget captions that describes UI element's functionalities. It is used for training and evaluation of the widget captioning model (please see the EMNLP'20 paper: https://arxiv.org/abs/2010.04295).

☆23

Alternatives and similar repositories for widget-caption

Users that are interested in widget-caption are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

google-research-datasets / screen2words
View on GitHub
The dataset includes screen summaries that describes Android app screenshot's functionalities. It is used for training and evaluation of …
☆67Jul 27, 2021Updated 4 years ago
Review4Repair / Review4Repair
View on GitHub
☆10Feb 8, 2021Updated 5 years ago
google-research-datasets / uibert
View on GitHub
It includes two datasets that are used in the downstream tasks for evaluating UIBert: App Similar Element Retrieval data and Visual Item …
☆48Aug 2, 2021Updated 4 years ago
deepneuralmachine / seq2act-tensorflow
View on GitHub
Seq2act: Mapping Natural Language Instructions to Mobile UI Action Sequences from Google research
☆15Jul 13, 2020Updated 6 years ago
UCSB-AI / Screen-Point-and-Read
View on GitHub
Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"
☆31May 12, 2026Updated 2 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
js0nwu / webui
View on GitHub
☆132Dec 4, 2023Updated 2 years ago
testtestA6 / VisionDroid
View on GitHub
VisionDroid
☆22Apr 2, 2024Updated 2 years ago
SageSELab / AndroR2
View on GitHub
This repository holds the data and code for the AndroR2 dataset of manually-reproduced bug reports for Android apps
☆26Jun 11, 2021Updated 5 years ago
asmahdi / lunarvr
View on GitHub
LunarVR is a virtual reality application made for NASA SPACE APPPS CHALLENGE 2018. This project was awarded as Global Winner in Best use …
☆12Feb 7, 2023Updated 3 years ago
datadrivendesign / semantic-icon-classifier
View on GitHub
☆36Nov 22, 2022Updated 3 years ago
google-research-datasets / clay
View on GitHub
The dataset includes UI object type labels (e.g., BUTTON, IMAGE, CHECKBOX) that describes the semantic type of an UI object on Android ap…
☆54Jan 14, 2022Updated 4 years ago
chenjshnn / LabelDroid
View on GitHub
Unblind Your Apps: Predicting Natural-Language Labels for Mobile GUI Components by Deep Learning
☆48Nov 16, 2023Updated 2 years ago
google-research-datasets / screen_qa
View on GitHub
ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K …
☆151Feb 7, 2025Updated last year
gridaco / context
View on GitHub
📖 UI/UX context detection engine
☆12Jan 3, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
google-research-datasets / screen_annotation
View on GitHub
The Screen Annotation dataset consists of pairs of mobile screenshots and their annotations. The annotations are in text format, and desc…
☆93Mar 7, 2024Updated 2 years ago
UGAIForge / DesignRepair
View on GitHub
☆13Feb 24, 2025Updated last year
google-research-datasets / seq2act
View on GitHub
This repository contains the opensource version of the datasets were used for different parts of training and testing of models that grou…
☆35Aug 20, 2020Updated 5 years ago
njucckevin / SeeClick
View on GitHub
The model, data and code for the visual GUI Agent SeeClick
☆493Jul 13, 2025Updated last year
imJouch / helmet-detection
View on GitHub
非机动车头盔佩戴检测
☆12Jan 20, 2026Updated 6 months ago
Jl-wei / guing
View on GitHub
A mobile GUI search engine using a vision-language model
☆15May 5, 2025Updated last year
DelTA-Lab-IITK / shad3s
View on GitHub
☆14Mar 31, 2022Updated 4 years ago
mikezucc / augmented-reality-fighter-pygame
View on GitHub
Using OpenCV to create an AR extension for PyGame
☆11May 17, 2019Updated 7 years ago
YuxiangChai / AMEX-codebase
View on GitHub
☆33Sep 27, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
aburns4 / MoTIF
View on GitHub
Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments
☆61Aug 19, 2024Updated last year
amrendra18 / codezera
View on GitHub
Capstone Project : part of Android Nanodegree program at Udacity
☆12Dec 22, 2016Updated 9 years ago
spyysalo / s800
View on GitHub
Tools for working with the S800 corpus
☆12Sep 17, 2020Updated 5 years ago
pixelatedbrian / fortnight-furniture
View on GitHub
☆10May 30, 2018Updated 8 years ago
sbunian / VINS
View on GitHub
VINS: Visual Search for Mobile User Interface Design
☆53Jan 9, 2021Updated 5 years ago
MulongXie / GUI-Perceptual-Grouping
View on GitHub
Recognize graphic user interface layout through grouping GUI elements according to their visual attributes
☆50Jun 17, 2022Updated 4 years ago
oaishi / 3DScene_from_text
View on GitHub
☆18Feb 21, 2022Updated 4 years ago
guanjunyou / douyin
View on GitHub
字节跳动青训营项目极简抖音后端
☆16Aug 17, 2023Updated 2 years ago
luileito / enrico
View on GitHub
A curated mobile app design database
☆72Sep 27, 2021Updated 4 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
cooelf / Auto-GUI
View on GitHub
Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)
☆261Jul 16, 2024Updated 2 years ago
harish-kamath / rqae
View on GitHub
Residual Quantization Autoencoder, used for interpreting LLMs
☆14Jan 1, 2025Updated last year
microsoft / UICaption
View on GitHub
We release the UICaption dataset. The dataset consists of UI images (icons and screenshots) and associated text descriptions. This datase…
☆42Nov 29, 2022Updated 3 years ago
shiqinghuayi19 / LLMforEvent
View on GitHub
This is the public repository of AAAI 2024 paper "Is a Large Language Model a Good Annotator for Event Extraction"
☆10Feb 16, 2024Updated 2 years ago
facebookarchive / ParseAppLinksAnalytics
View on GitHub
[Archive]
☆18Aug 29, 2017Updated 8 years ago
fpdetective / modCrawler
View on GitHub
Crawler based on a modified browser to detect online tracking.
☆11Jul 19, 2023Updated 3 years ago
ihsavru / LanguageBot
View on GitHub
A language learning bot for messenger.
☆16Jan 20, 2018Updated 8 years ago