Recognize graphic user interface layout through grouping GUI elements according to their visual attributes
☆50Jun 17, 2022Updated 4 years ago
Alternatives and similar repositories for GUI-Perceptual-Grouping
Users that are interested in GUI-Perceptual-Grouping are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VisionTasker introduces a novel two-stage framework combining vision-based UI understanding and LLM task planning for mobile task automat…☆107Jul 18, 2025Updated 11 months ago
- An accurate GUI element detection approach based on old-fashioned CV algorithms [Upgraded on 5/July/2021]☆545Nov 8, 2023Updated 2 years ago
- The dataset includes UI object type labels (e.g., BUTTON, IMAGE, CHECKBOX) that describes the semantic type of an UI object on Android ap…☆54Jan 14, 2022Updated 4 years ago
- Object Detection for Graphical User Interface: Old Fashioned or Deep Learning or a Combination?☆128Feb 20, 2024Updated 2 years ago
- ☆34Oct 1, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆36Nov 22, 2022Updated 3 years ago
- A mobile GUI search engine using a vision-language model☆14May 5, 2025Updated last year
- UICrit is a dataset containing human-generated natural language design critiques, corresponding bounding boxes for each critique, and des…☆25Nov 19, 2024Updated last year
- ☆17May 14, 2024Updated 2 years ago
- This repository contains the opensource version of the datasets were used for different parts of training and testing of models that grou…☆34Aug 20, 2020Updated 5 years ago
- ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K …☆150Feb 7, 2025Updated last year
- The dataset includes widget captions that describes UI element's functionalities. It is used for training and evaluation of the widget ca…☆23Jun 24, 2021Updated 4 years ago
- Automating Android apps with ChatGPT-like LLM.☆157Jan 17, 2024Updated 2 years ago
- Memory Cleaner, Phone Booster and Optimizer.☆10Nov 20, 2018Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Explore Android apps like human.☆133Feb 18, 2023Updated 3 years ago
- This is a project that aims to use Claude.ai's coding capabilities, artifact capabilities, and project capabilities to create a new metho…☆12Jan 31, 2025Updated last year
- Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments☆61Aug 19, 2024Updated last year
- UI auto test framework based on YOLO to recognize elements, less code, less maintenance, cross platform, cross project / 基于YOLO的UI层自动化测试框…☆15Feb 27, 2026Updated 3 months ago
- ☆30Apr 16, 2024Updated 2 years ago
- "Editing Motion Graphics Video via Motion Vectorization and Transformation." SIGGRAPH Asia 2023.☆13Jan 24, 2024Updated 2 years ago
- Android本地运行mnn-llm语言模型简单示例☆13Oct 2, 2025Updated 8 months ago
- [ACL 2024] On the Multi-turn Instruction Following for Conversational Web Agents☆17Oct 12, 2024Updated last year
- 吴恩达大模型系列课程中文版,包括《Prompt Engineering》、《Building System》和《LangChain》☆12Jun 7, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆33Sep 27, 2024Updated last year
- Seq2act: Mapping Natural Language Instructions to Mobile UI Action Sequences from Google research☆15Jul 13, 2020Updated 5 years ago
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆24Mar 18, 2025Updated last year
- Scene-OCR: CRAFT: text detection + TPS-ResNet-BiLSTM-Attn: text recognition☆10Nov 22, 2022Updated 3 years ago
- ☆14Dec 25, 2023Updated 2 years ago
- 🎧 Real-time data streaming from NeuroSky MindWave Mobile Headset☆10Jul 17, 2020Updated 5 years ago
- ☆12Oct 23, 2019Updated 6 years ago
- Mobile App Analysis and Testing Literature☆109Apr 7, 2026Updated 2 months ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆14Jul 27, 2025Updated 10 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Turn any paper into a browser-readable annotated webpage, zero API keys required.☆108Mar 15, 2026Updated 3 months ago
- ☆17Oct 30, 2023Updated 2 years ago
- Emotiv SDK Community Edition☆13Oct 9, 2015Updated 10 years ago
- ☆132Dec 4, 2023Updated 2 years ago
- CVPR25☆28Jul 2, 2025Updated 11 months ago
- ☆46Mar 19, 2024Updated 2 years ago
- Interactive Installation using Neurosky EEG, Touch Designer, and a Stereoscopic CAVE☆11Jun 12, 2014Updated 12 years ago