Benchmarking Mobile Device Control Agents across Diverse Configurations (ICLR 2024 workshop GenAI4DM spotlight presentation; CoLLAs 2025)
☆35Jul 21, 2025Updated 8 months ago
Alternatives and similar repositories for b-moca
Users that are interested in b-moca are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evaluating Safety of Autonomous Agents in Mobile Device Control (AAAI 2026 AI Alignment Track)☆32Jan 28, 2026Updated last month
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Sep 25, 2023Updated 2 years ago
- LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task Automation☆67Aug 9, 2024Updated last year
- AndroidWorld is an environment and benchmark for autonomous agents☆679Updated this week
- Improving Motion in Image-to-Video Models via Adaptive Low-Pass Guidance (CVPR 2026)☆55Feb 23, 2026Updated last month
- ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World☆25Jun 17, 2025Updated 9 months ago
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.☆392Feb 22, 2025Updated last year
- DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image Editing (ICLR 2025)☆44May 18, 2025Updated 10 months ago
- Towards Large Multimodal Models as Visual Foundation Agents☆259Apr 24, 2025Updated 11 months ago
- ☆22May 23, 2025Updated 10 months ago
- GUICourse: From General Vision Langauge Models to Versatile GUI Agents☆137Mar 1, 2026Updated 3 weeks ago
- ☆35Jan 12, 2026Updated 2 months ago
- This repository contains the opensource version of the datasets were used for different parts of training and testing of models that grou…☆34Aug 20, 2020Updated 5 years ago
- [ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…☆149Jan 3, 2026Updated 2 months ago
- Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"☆29Jul 31, 2024Updated last year
- Official Repository for Can Language Models be Instructed to Protect Personal Information?☆13Oct 8, 2023Updated 2 years ago
- Rust implementation of Surya☆66Mar 1, 2025Updated last year
- ReasoningShield: Safety Detection over Reasoning Traces of Large Reasoning Models☆25Sep 27, 2025Updated 5 months ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Jun 5, 2025Updated 9 months ago
- ☆24Jan 19, 2026Updated 2 months ago
- One-to-many Approach for Improving Super-Resolution implemented in Tensorflow 2.x☆16May 15, 2022Updated 3 years ago
- ☆51Dec 31, 2024Updated last year
- Implementation of KDR-Agent, the AAAI 2025 accepted paper, focusing on knowledge-driven reasoning for autonomous agents.☆18Nov 24, 2025Updated 4 months ago
- ☆15Apr 8, 2022Updated 3 years ago
- ☆14May 1, 2023Updated 2 years ago
- Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)☆100Oct 14, 2024Updated last year
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆148Nov 26, 2024Updated last year
- Implementation of the paper 'Improve Discourse Dependency Parsing with Contextualized Representations', Findings of NAACL 2022☆14Jul 15, 2022Updated 3 years ago
- Incredibly descriptive audiovisual summaries for videos☆41Aug 2, 2024Updated last year
- Code for Paper "Towards More Generalizable One-Shot Visual Imitation Learning", ICRA 2022☆20May 5, 2022Updated 3 years ago
- Tools for working with the S800 corpus☆12Sep 17, 2020Updated 5 years ago
- DroidAgent: Intent-Driven Mobile GUI Testing with Autonomous LLM Agents☆60Mar 12, 2024Updated 2 years ago
- Supercharge huggingface transformers with model parallelism.☆78Jul 23, 2025Updated 8 months ago
- ☆41Dec 9, 2025Updated 3 months ago
- Official implementation of Tabular Transfer Learning via Prompting LLMs (COLM 2024).☆13Aug 6, 2024Updated last year
- Unrailed! simulator using C++ with some reinforcement learning and Unrailed! AI using Python with OpenCV☆17Dec 6, 2021Updated 4 years ago
- ☆307Aug 18, 2025Updated 7 months ago
- Code for Paper "Explore More Guidance: A Task-aware Instruction Network for Sign Language Translation Enhanced with Data Augmentation"☆12Feb 6, 2023Updated 3 years ago
- [TMLR] LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects☆162Dec 2, 2025Updated 3 months ago