aburns4/MoTIF

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aburns4/MoTIF)

aburns4 / MoTIF

Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments

☆61

Alternatives and similar repositories for MoTIF

Users that are interested in MoTIF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

google-research-datasets / seq2act
View on GitHub
This repository contains the opensource version of the datasets were used for different parts of training and testing of models that grou…
☆35Aug 20, 2020Updated 5 years ago
X-LANCE / Mobile-Env
View on GitHub
A Universal Platform for Training and Evaluation of Mobile Interaction
☆63Sep 24, 2025Updated 10 months ago
google-research-datasets / uibert
View on GitHub
It includes two datasets that are used in the downstream tasks for evaluating UIBert: App Similar Element Retrieval data and Visual Item …
☆48Aug 2, 2021Updated 4 years ago
datadrivendesign / semantic-icon-classifier
View on GitHub
☆36Nov 22, 2022Updated 3 years ago
google-research-datasets / screen_qa
View on GitHub
ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K …
☆151Feb 7, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
LlamaTouch / LlamaTouch
View on GitHub
LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task Automation
☆70Aug 9, 2024Updated last year
deepneuralmachine / seq2act-tensorflow
View on GitHub
Seq2act: Mapping Natural Language Instructions to Mobile UI Action Sequences from Google research
☆15Jul 13, 2020Updated 6 years ago
LlamaTouch / AgentEnv
View on GitHub
An environment for mobile angets to interact with realistic android device or android emulator
☆13Jul 19, 2024Updated 2 years ago
Zsbyqx20 / AgentHazard
View on GitHub
Mobile GUI Agents under Real-world Threats: Are We There Yet?
☆17May 18, 2026Updated 2 months ago
MobileLLM / AgentProg
View on GitHub
AgentProg: Empowering Long-Horizon GUI Agents with Program-Guided Context Management
☆32Apr 10, 2026Updated 3 months ago
microsoft / UICaption
View on GitHub
We release the UICaption dataset. The dataset consists of UI images (icons and screenshots) and associated text descriptions. This datase…
☆42Nov 29, 2022Updated 3 years ago
njucckevin / SeeClick
View on GitHub
The model, data and code for the visual GUI Agent SeeClick
☆493Jul 13, 2025Updated last year
sidongfeng / MUD
View on GitHub
☆17May 14, 2024Updated 2 years ago
X-LANCE / META-GUI-baseline
View on GitHub
[EMNLP 2022] The baseline code for META-GUI dataset
☆16Jul 9, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
google-research-datasets / screen2words
View on GitHub
The dataset includes screen summaries that describes Android app screenshot's functionalities. It is used for training and evaluation of …
☆67Jul 27, 2021Updated 5 years ago
OpenGVLab / GUI-Odyssey
View on GitHub
[ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…
☆159Jan 3, 2026Updated 6 months ago
AndroidArenaAgent / AndroidArena
View on GitHub
☆47Apr 11, 2024Updated 2 years ago
cooelf / Auto-GUI
View on GitHub
Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)
☆261Jul 16, 2024Updated 2 years ago
MobileAgentBench / mobile-agent-bench
View on GitHub
☆37Sep 30, 2024Updated last year
IMNearth / CoAT
View on GitHub
Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)
☆103Oct 14, 2024Updated last year
google-research-datasets / screen_annotation
View on GitHub
The Screen Annotation dataset consists of pairs of mobile screenshots and their annotations. The annotations are in text format, and desc…
☆93Mar 7, 2024Updated 2 years ago
meera1hahn / Graph_LED
View on GitHub
Localization via embodied dialog on the navigation graph
☆15Apr 18, 2022Updated 4 years ago
js0nwu / webui
View on GitHub
☆132Dec 4, 2023Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
tobyli / Screen2Vec
View on GitHub
Screen2Vec is a new self-supervised technique for generating more comprehensive semantic embeddings of GUI screens and components using t…
☆85Feb 3, 2025Updated last year
zzxslp / MM-Navigator
View on GitHub
GPT-4V in Wonderland: LMMs as Smartphone Agents
☆134Jul 17, 2024Updated 2 years ago
google-research-datasets / widget-caption
View on GitHub
The dataset includes widget captions that describes UI element's functionalities. It is used for training and evaluation of the widget ca…
☆23Jun 24, 2021Updated 5 years ago
MobileLLM / DroidBot-GPT
View on GitHub
Automating Android apps with ChatGPT-like LLM.
☆157Jan 17, 2024Updated 2 years ago
chuyg1005 / seeclick-crawler
View on GitHub
☆20Apr 24, 2024Updated 2 years ago
bit-ml / VeriDark
View on GitHub
Dark Web Authorship Verification Dataset
☆16May 15, 2023Updated 3 years ago
google-research-datasets / rico_semantics
View on GitHub
Consists of ~500k human annotations on the RICO dataset identifying various icons based on their shapes and semantics, and associations b…
☆36Jun 27, 2024Updated 2 years ago
coinse / droidagent
View on GitHub
DroidAgent: Intent-Driven Mobile GUI Testing with Autonomous LLM Agents
☆71Mar 12, 2024Updated 2 years ago
MulongXie / UIED
View on GitHub
An accurate GUI element detection approach based on old-fashioned CV algorithms [Upgraded on 5/July/2021]
☆549Nov 8, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
SalesforceAIResearch / swecomm
View on GitHub
☆28Jun 2, 2026Updated last month
yzygitzh / Humanoid
View on GitHub
Explore Android apps like human.
☆134Feb 18, 2023Updated 3 years ago
yasumasaonoe / ecbd
View on GitHub
☆11Apr 23, 2023Updated 3 years ago
RUCBM / GUICourse
View on GitHub
GUICourse: From General Vision Langauge Models to Versatile GUI Agents
☆143Mar 1, 2026Updated 4 months ago
Mister-iks / ai_suggest_deployment
View on GitHub
AI SUGGEST is a powerful command-line assistant that leverages AI to provide accurate Linux commands based on natural language queries. S…
☆11Aug 22, 2024Updated last year
showlab / assistgui
View on GitHub
☆30Apr 16, 2024Updated 2 years ago
showlab / Awesome-GUI-Agent
View on GitHub
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
☆1,199Aug 17, 2025Updated 11 months ago