google-research-datasets / uicrit
UICrit is a dataset containing human-generated natural language design critiques, corresponding bounding boxes for each critique, and design quality ratings for 1,000 mobile UIs from RICO. This dataset was collected for our UIST '24 paper: https://arxiv.org/abs/2407.08850.
☆20Updated 5 months ago
Alternatives and similar repositories for uicrit
Users that are interested in uicrit are comparing it to the libraries listed below
Sorting:
- Code for the paper "AutoPresent: Designing Structured Visuals From Scratch" (CVPR 2025)☆74Updated 2 months ago
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆44Updated last year
- ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation☆36Updated last month
- ☆29Updated 7 months ago
- [NeurIPS 2024 D&B] VideoGUI: A Benchmark for GUI Automation from Instructional Videos☆35Updated last month
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆43Updated 2 months ago
- Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"☆28Updated 9 months ago
- Repo for paper: https://arxiv.org/abs/2404.06479☆27Updated 7 months ago
- Consists of ~500k human annotations on the RICO dataset identifying various icons based on their shapes and semantics, and associations b…☆27Updated 10 months ago
- The dataset includes UI object type labels (e.g., BUTTON, IMAGE, CHECKBOX) that describes the semantic type of an UI object on Android ap…☆52Updated 3 years ago
- Multimodal RewardBench☆39Updated 2 months ago
- Code for ACM MM'23 paper: LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation☆48Updated 9 months ago
- ☆13Updated 7 months ago
- Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".☆57Updated last month
- ☆40Updated 10 months ago
- ☆44Updated last month
- Code for our Paper "All in an Aggregated Image for In-Image Learning"☆30Updated last year
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Updated last year
- Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"☆35Updated 8 months ago
- ☆29Updated 10 months ago
- Official Repository of Personalized Visual Instruct Tuning☆28Updated 2 months ago
- ☆20Updated 9 months ago
- Code released for our CHI2023 paper "UEyes: Understanding Visual Saliency across User Interface Types"☆28Updated 10 months ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆18Updated 10 months ago
- Implementation and dataset for paper "Can MLLMs Perform Text-to-Image In-Context Learning?"☆39Updated 2 months ago
- [ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models☆71Updated 8 months ago
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆66Updated 8 months ago
- [NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effect…☆35Updated 11 months ago
- OpenCOLE: Towards Reproducible Automatic Graphic Design Generation [Inoue+, CVPRW2024 (GDUG)]☆67Updated 2 months ago
- A benchmark dataset for evaluating LLM's SVG editing capabilities☆31Updated 7 months ago