Recognize graphic user interface layout through grouping GUI elements according to their visual attributes
☆49Jun 17, 2022Updated 3 years ago
Alternatives and similar repositories for GUI-Perceptual-Grouping
Users that are interested in GUI-Perceptual-Grouping are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VisionTasker introduces a novel two-stage framework combining vision-based UI understanding and LLM task planning for mobile task automat…☆105Jul 18, 2025Updated 8 months ago
- An accurate GUI element detection approach based on old-fashioned CV algorithms [Upgraded on 5/July/2021]☆533Nov 8, 2023Updated 2 years ago
- The dataset includes UI object type labels (e.g., BUTTON, IMAGE, CHECKBOX) that describes the semantic type of an UI object on Android ap…☆54Jan 14, 2022Updated 4 years ago
- Object Detection for Graphical User Interface: Old Fashioned or Deep Learning or a Combination?☆129Feb 20, 2024Updated 2 years ago
- ☆36Nov 22, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…☆150Jan 3, 2026Updated 2 months ago
- Conv Net for identifying GUI componenets from screenshots using Tensorflow☆12Mar 24, 2023Updated 3 years ago
- A mobile GUI search engine using a vision-language model☆14May 5, 2025Updated 10 months ago
- UICrit is a dataset containing human-generated natural language design critiques, corresponding bounding boxes for each critique, and des…☆26Nov 19, 2024Updated last year
- ☆44Dec 8, 2025Updated 3 months ago
- ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K …☆142Feb 7, 2025Updated last year
- It includes two datasets that are used in the downstream tasks for evaluating UIBert: App Similar Element Retrieval data and Visual Item …☆47Aug 2, 2021Updated 4 years ago
- The dataset includes widget captions that describes UI element's functionalities. It is used for training and evaluation of the widget ca…☆23Jun 24, 2021Updated 4 years ago
- Dataset builder toolkit for Pix2code Screenshot-to-code☆35Sep 19, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Automating Android apps with ChatGPT-like LLM.☆154Jan 17, 2024Updated 2 years ago
- Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments☆61Aug 19, 2024Updated last year
- ☆30Apr 16, 2024Updated last year
- Convert style sheets to json☆25Jan 20, 2020Updated 6 years ago
- This is the official repository for "Can GPTs Evaluate Graphic Design Based on Design Principles?".☆13Feb 10, 2025Updated last year
- [ACL 2024] On the Multi-turn Instruction Following for Conversational Web Agents☆17Oct 12, 2024Updated last year
- ☆31Sep 27, 2024Updated last year
- Seq2act: Mapping Natural Language Instructions to Mobile UI Action Sequences from Google research☆15Jul 13, 2020Updated 5 years ago
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆23Mar 18, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Scene-OCR: CRAFT: text detection + TPS-ResNet-BiLSTM-Attn: text recognition☆10Nov 22, 2022Updated 3 years ago
- 🎧 Real-time data streaming from NeuroSky MindWave Mobile Headset☆10Jul 17, 2020Updated 5 years ago
- The model, data and code for the visual GUI Agent SeeClick☆475Jul 13, 2025Updated 8 months ago
- 归纳了用mobilenet加arcfaceloss训练模型的keras框架,并提供将模型转为八位tflite的脚本☆12Nov 21, 2022Updated 3 years ago
- GUI Grounding for Professional High-Resolution Computer Use☆354Mar 4, 2026Updated 3 weeks ago
- ☆131Dec 4, 2023Updated 2 years ago
- RL-AFEC: Adaptive Forward Error Correction for Real-time Video Communication Based on Reinforcement Learning☆21Mar 31, 2022Updated 3 years ago
- CVPR25☆27Jul 2, 2025Updated 8 months ago
- ☆11Jul 18, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Interactive Installation using Neurosky EEG, Touch Designer, and a Stereoscopic CAVE☆11Jun 12, 2014Updated 11 years ago
- Custom object detection for UI of the design system using TensorFlow☆17Jun 20, 2023Updated 2 years ago
- A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.☆46Dec 17, 2025Updated 3 months ago
- A corpus generation tool☆27Jan 5, 2026Updated 2 months ago
- Interface with Emotiv device and keystroke events☆13Apr 29, 2023Updated 2 years ago
- Obsolete repo, merged into eynollah☆12Sep 29, 2025Updated 6 months ago
- Some simple codes to format the CSDMC2010 SPAM corpus☆11Sep 18, 2016Updated 9 years ago