microsoft / UICaption
We release the UICaption dataset. The dataset consists of UI images (icons and screenshots) and associated text descriptions. This dataset was used to pre-train the Lexi model which provides a generic representation of UI screens and their components.
☆35Updated last year
Related projects ⓘ
Alternatives and complementary repositories for UICaption
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆51Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- ☆21Updated 8 months ago
- DeFacto - Demonstrations and Feedback for improving factual consistency of text summarization☆27Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Repository for Skill Set Optimization☆12Updated 3 months ago
- SILO Language Models code repository☆80Updated 9 months ago
- [EACL 2023] CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification☆38Updated last year
- ☆16Updated 2 weeks ago
- This is the official PyTorch repo for "UNIREX: A Unified Learning Framework for Language Model Rationale Extraction" (ICML 2022).☆23Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated 2 weeks ago
- ☆25Updated 9 months ago
- A Data Source for Reasoning Embodied Agents☆19Updated last year
- ☆14Updated last year
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models☆20Updated 4 months ago
- Code and data from the paper 'Human Feedback is not Gold Standard'☆18Updated 4 months ago
- Generating and validating natural-language explanations.☆42Updated last week
- AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers☆42Updated 2 years ago
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆25Updated 4 months ago
- The official repo of our research work "Interactive Editing for Text Summarization".☆22Updated last year
- Code for “Pretrained Language Models as Visual Planners for Human Assistance”☆57Updated last year
- ☆26Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆44Updated last year
- This repo contains data and code for the paper "Reasoning over Public and Private Data in Retrieval-Based Systems."☆46Updated 4 months ago
- Fault-aware neural code rankers☆25Updated last year
- In this codebase we establish a benchmark for egocentric user adaptation based on Ego4d.First, we start from a population model which ha…☆11Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆44Updated 10 months ago
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- Index of URLs to pdf files all over the internet and scripts☆21Updated last year
- Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"☆23Updated 3 months ago