gridaco / ui-datasetLinks
A pre labelled dataset for ui element / layout detection
☆66Updated 2 years ago
Alternatives and similar repositories for ui-dataset
Users that are interested in ui-dataset are comparing it to the libraries listed below
Sorting:
- The dataset includes UI object type labels (e.g., BUTTON, IMAGE, CHECKBOX) that describes the semantic type of an UI object on Android ap…☆52Updated 3 years ago
- Detectron2 Webserver (Faster-RCNN) implementation for Ubuntu 20.04. Real time object detection served over the internet.☆32Updated 2 years ago
- ☆122Updated last year
- It includes two datasets that are used in the downstream tasks for evaluating UIBert: App Similar Element Retrieval data and Visual Item …☆45Updated 4 years ago
- Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"☆357Updated last week
- An AI agent for interacting with a computer using the graphical user interface☆76Updated last year
- GUI Grounding for Professional High-Resolution Computer Use☆248Updated 3 weeks ago
- GPT-4V in Wonderland: LMMs as Smartphone Agents☆134Updated last year
- The dataset includes screen summaries that describes Android app screenshot's functionalities. It is used for training and evaluation of …☆58Updated 4 years ago
- Tiny, structured coding tutorials that can be searched semantically☆163Updated last year
- Recognize graphic user interface layout through grouping GUI elements according to their visual attributes☆45Updated 3 years ago
- Conversational UI for Figma☆88Updated 2 years ago
- Figma Files Scraper for Research & Studies☆24Updated 2 months ago
- The Screen Annotation dataset consists of pairs of mobile screenshots and their annotations. The annotations are in text format, and desc…☆73Updated last year
- Multimodal computer agent data collection program☆145Updated last year
- Framework to evaluate LLM generated ReactJS code.☆58Updated last year
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆157Updated 6 months ago
- ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K …☆126Updated 6 months ago
- Custom object detection for UI of the design system using TensorFlow☆16Updated 2 years ago
- Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)☆247Updated last year
- Consists of ~500k human annotations on the RICO dataset identifying various icons based on their shapes and semantics, and associations b…☆30Updated last year
- TaxyAI: Open-source browser automation with GPT-4 (backup)☆55Updated 2 years ago
- Generate an End-to-End test with a single sentence☆37Updated last year
- A codebase for "Language Models can Solve Computer Tasks"☆234Updated last year
- Learn how to use logit bias with OpenAI models to create highly-powerful classifiers in minutes.☆34Updated 2 years ago
- Test suite for LLM prompts☆52Updated last year
- The model, data and code for the visual GUI Agent SeeClick☆417Updated last month
- Complex question answering in LLMs with enhanced reasoning and information-seeking capabilities.☆201Updated last year
- Mobile adapter for IOS and android for mobile LLM agents☆37Updated 9 months ago
- Self-hosted version of Microsoft's OmniParser Image-to-text model☆73Updated 3 months ago