We release the UICaption dataset. The dataset consists of UI images (icons and screenshots) and associated text descriptions. This dataset was used to pre-train the Lexi model which provides a generic representation of UI screens and their components.
☆42Nov 29, 2022Updated 3 years ago
Alternatives and similar repositories for UICaption
Users that are interested in UICaption are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments☆61Aug 19, 2024Updated last year
- Code that accompanies the PyData New York (2022) talk: Addressing the sensitivity of Large language models☆13Nov 7, 2022Updated 3 years ago
- [Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training☆22Mar 19, 2022Updated 4 years ago
- Quality Metrics for evaluating the inter-cluster reliability of Multidimensional Projections☆27Apr 30, 2023Updated 3 years ago
- ☆16Oct 1, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for the paper "LASER: LLM Agent with State-Space Exploration for Web Navigation"☆35Sep 26, 2023Updated 2 years ago
- The dataset includes screen summaries that describes Android app screenshot's functionalities. It is used for training and evaluation of …☆67Jul 27, 2021Updated 4 years ago
- The dataset includes UI object type labels (e.g., BUTTON, IMAGE, CHECKBOX) that describes the semantic type of an UI object on Android ap…☆54Jan 14, 2022Updated 4 years ago
- SeeSo(Eye-Tracking SDK) sample for iOS☆13Jan 5, 2024Updated 2 years ago
- Web app for makeup transfer using Stable Diffusion☆10Sep 11, 2023Updated 2 years ago
- ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K …☆150Feb 7, 2025Updated last year
- Acoustic Content Single Page Application website implemented in React☆13Sep 20, 2021Updated 4 years ago
- A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering☆18Jan 7, 2023Updated 3 years ago
- 📖 UI/UX context detection engine☆12Jan 3, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- My personal implementation of SVTR model for handwritten OCR☆14Mar 1, 2024Updated 2 years ago
- SeeSo(Eye-Tracking SDK) Sample vanillaJS script.☆14Oct 17, 2023Updated 2 years ago
- VNOnDB dataset extractor. This dataset can be use for build deep learning model to attack vietnamese handwritten text recognition problem…☆19Sep 8, 2021Updated 4 years ago
- Workflow-Guided Exploration: sample-efficient RL agent for web tasks☆118Jun 5, 2023Updated 3 years ago
- DeepStyle provides pretrained models aiming to project text in a stylometric space. The base project consists in a new method of represen…☆15Jun 9, 2023Updated 3 years ago
- This repository shows how to train a CNN model for detecting vehicles and other objects on streets☆16Nov 21, 2019Updated 6 years ago
- In-IDE Code Search☆29Apr 29, 2022Updated 4 years ago
- package for crowd counting☆11Jun 23, 2020Updated 6 years ago
- 한국어 다중분류 감성분석☆20Jun 7, 2022Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A Fabric Modpack aims for both Performance and Quality Of Life☆14Jun 5, 2023Updated 3 years ago
- Screen2Vec is a new self-supervised technique for generating more comprehensive semantic embeddings of GUI screens and components using t…☆84Feb 3, 2025Updated last year
- Version 3.0.0 Pytorch implementations of DQN, DDQN, DDPG, SAC, Discrete SAC. With more features :)☆12Feb 16, 2023Updated 3 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- B2 Quick Start Sample App using Python (boto3) and B2 S3 Compatible API. First in series. This sample app integrates with a pre-staged B…☆20Jan 20, 2026Updated 5 months ago
- ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation. AAAI, 2025☆16Aug 25, 2025Updated 10 months ago
- SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech☆27May 25, 2023Updated 3 years ago
- ☆20Jun 19, 2020Updated 6 years ago
- Cross-Domain Imitation Learning via Optimal Transport☆27Jun 24, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official Implementation of LatentSwap3D: Semantic Edits on 3D Image GANs☆23Nov 28, 2023Updated 2 years ago
- PyTorch implementation of Data2Vec self-supervised approach for vision use cases.☆18Oct 7, 2022Updated 3 years ago
- Yolo object detection browser, Power by onnxruntime-web, Support WebGPU, wasm(cpu). Webcam support for live detection, Add your custom mo…☆38Jan 17, 2026Updated 5 months ago
- Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.☆22Dec 7, 2023Updated 2 years ago
- An official code of Densely-packed Object Detection via Hard Negative-Aware Anchor Attention in WACV2022☆12Jan 6, 2022Updated 4 years ago
- Program for detecting objects☆16Apr 17, 2024Updated 2 years ago
- Real-time .NET proxy and dashboard for inspecting AI coding agent API calls (currently supports Claude Code)☆43Jun 21, 2026Updated last week