Benchmarking Mobile Device Control Agents across Diverse Configurations (ICLR 2024 workshop GenAI4DM spotlight presentation; CoLLAs 2025)
☆35Jul 21, 2025Updated 7 months ago
Alternatives and similar repositories for b-moca
Users that are interested in b-moca are comparing it to the libraries listed below
Sorting:
- Evaluating Safety of Autonomous Agents in Mobile Device Control (AAAI 2026 AI Alignment Track)☆32Jan 28, 2026Updated last month
- ☆31Sep 27, 2024Updated last year
- LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task Automation☆67Aug 9, 2024Updated last year
- ☆44Apr 11, 2024Updated last year
- Track and Collaborate on ML & AI Experiments.☆44Mar 10, 2025Updated 11 months ago
- (ICLR 2025) The Official Code Repository for GUI-World.☆68Dec 18, 2024Updated last year
- AndroidWorld is an environment and benchmark for autonomous agents☆640Feb 24, 2026Updated last week
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Jun 5, 2025Updated 8 months ago
- ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World☆24Jun 17, 2025Updated 8 months ago
- Towards Large Multimodal Models as Visual Foundation Agents☆256Apr 24, 2025Updated 10 months ago
- Rust implementation of Surya☆65Mar 1, 2025Updated last year
- [ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…☆147Jan 3, 2026Updated 2 months ago
- ☆38Jun 16, 2024Updated last year
- [ICSE'24] Latest 7,796 active and unique ransomware samples from 95 families. Code is in another repository. 勒索软件数据破坏攻击分析☆32Mar 1, 2024Updated 2 years ago
- Incredibly descriptive audiovisual summaries for videos☆41Aug 2, 2024Updated last year
- ☆22May 23, 2025Updated 9 months ago
- ☆44Mar 19, 2024Updated last year
- Utilities for efficient fine-tuning, inference and evaluation of code generation models☆21Oct 3, 2023Updated 2 years ago
- Interactive coding assistant for data scientists and machine learning developers, empowered by large language models.☆99Oct 8, 2024Updated last year
- Convolutional Interactive Artificial Neural Networks by/for Astrophysicists☆53Feb 21, 2026Updated last week
- Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)☆99Oct 14, 2024Updated last year
- Develop, evaluate and monitor LLM applications at scale☆100Nov 29, 2024Updated last year
- ☆96Mar 26, 2024Updated last year
- Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"☆28Jul 31, 2024Updated last year
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆64Oct 19, 2024Updated last year
- GPT-style network for phonemization with durations of text☆68Mar 21, 2024Updated last year
- ☆35Jan 12, 2026Updated last month
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆30Dec 12, 2024Updated last year
- MaxMSP TUIO 1.1 client☆10Jun 16, 2016Updated 9 years ago
- Implementation of KDR-Agent, the AAAI 2025 accepted paper, focusing on knowledge-driven reasoning for autonomous agents.