The dataset includes screen summaries that describes Android app screenshot's functionalities. It is used for training and evaluation of the screen2words models (our paper accepted by UIST'21 will be linked soon).
☆67Jul 27, 2021Updated 4 years ago
Alternatives and similar repositories for screen2words
Users that are interested in screen2words are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- It includes two datasets that are used in the downstream tasks for evaluating UIBert: App Similar Element Retrieval data and Visual Item …☆47Aug 2, 2021Updated 4 years ago
- The Screen Annotation dataset consists of pairs of mobile screenshots and their annotations. The annotations are in text format, and desc…☆88Mar 7, 2024Updated 2 years ago
- The dataset includes UI object type labels (e.g., BUTTON, IMAGE, CHECKBOX) that describes the semantic type of an UI object on Android ap…☆54Jan 14, 2022Updated 4 years ago
- ☆31Sep 27, 2024Updated last year
- A mobile GUI search engine using a vision-language model☆14May 5, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments☆61Aug 19, 2024Updated last year
- This repository contains the opensource version of the datasets were used for different parts of training and testing of models that grou…☆34Aug 20, 2020Updated 5 years ago
- Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"☆29Jul 31, 2024Updated last year
- ☆23Oct 11, 2024Updated last year
- An accurate GUI element detection approach based on old-fashioned CV algorithms [Upgraded on 5/July/2021]☆538Nov 8, 2023Updated 2 years ago
- ☆35May 29, 2025Updated 10 months ago
- ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K …☆144Feb 7, 2025Updated last year
- We release the UICaption dataset. The dataset consists of UI images (icons and screenshots) and associated text descriptions. This datase…☆42Nov 29, 2022Updated 3 years ago
- This repository holds the data and code for the AndroR2 dataset of manually-reproduced bug reports for Android apps☆25Jun 11, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Object Detection for Graphical User Interface: Old Fashioned or Deep Learning or a Combination?☆129Feb 20, 2024Updated 2 years ago
- Crawl github data using API and no-API☆12Jul 4, 2017Updated 8 years ago
- Overview of Clone Detection Tools for Java☆14Aug 23, 2025Updated 7 months ago
- A GAN-based GUI generation method☆79May 22, 2021Updated 4 years ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆65Oct 19, 2024Updated last year
- ☆30Apr 16, 2024Updated last year
- (ICLR 2025) The Official Code Repository for GUI-World.☆69Dec 18, 2024Updated last year
- Web app for makeup transfer using Stable Diffusion☆10Sep 11, 2023Updated 2 years ago
- GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents☆19Feb 26, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆10Dec 21, 2020Updated 5 years ago
- A customized version of Ella used in the paper `An Empirical Study of Android Test Generation Tools in Industrial Cases`.☆10Aug 19, 2020Updated 5 years ago
- ☆26Nov 19, 2025Updated 4 months ago
- ☆10May 30, 2018Updated 7 years ago
- Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"☆382Mar 27, 2026Updated 2 weeks ago
- ☆15Jan 19, 2020Updated 6 years ago
- Repository of paper: Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models☆37Sep 19, 2023Updated 2 years ago
- Under construction☆13Jan 15, 2025Updated last year
- ☆10Aug 28, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆13Jul 27, 2025Updated 8 months ago
- The model, data and code for the visual GUI Agent SeeClick☆477Jul 13, 2025Updated 9 months ago
- Quality Metrics for evaluating the inter-cluster reliability of Multidimensional Projections☆26Apr 30, 2023Updated 2 years ago
- https://towardsdatascience.com/instance-segmentation-web-app-63016b8ed4ae☆12Mar 3, 2021Updated 5 years ago
- Recognize graphic user interface layout through grouping GUI elements according to their visual attributes☆49Jun 17, 2022Updated 3 years ago
- Urban Generative Intelligence (UGI): A Foundational Platform for Embodied Agent and Future City☆12Dec 17, 2023Updated 2 years ago
- ☆16Apr 9, 2021Updated 5 years ago