Benchmarking Mobile Device Control Agents across Diverse Configurations (ICLR 2024 workshop GenAI4DM spotlight presentation; CoLLAs 2025)
☆35Jul 21, 2025Updated 8 months ago
Alternatives and similar repositories for b-moca
Users that are interested in b-moca are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evaluating Safety of Autonomous Agents in Mobile Device Control (AAAI 2026 AI Alignment Track)☆33Jan 28, 2026Updated 2 months ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Sep 25, 2023Updated 2 years ago
- ☆46Apr 11, 2024Updated 2 years ago
- LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task Automation☆67Aug 9, 2024Updated last year
- ☆32Sep 27, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- AndroidWorld is an environment and benchmark for autonomous agents☆712Updated this week
- Improving Motion in Image-to-Video Models via Adaptive Low-Pass Guidance (CVPR 2026 Highlight)☆57Feb 23, 2026Updated last month
- ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World☆25Jun 17, 2025Updated 9 months ago
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.☆394Feb 22, 2025Updated last year
- (ICLR 2025) The Official Code Repository for GUI-World.☆69Dec 18, 2024Updated last year
- A tiny reinforcement learning codebase for continuous control, built on top of JAX.☆15Mar 28, 2023Updated 3 years ago
- Towards Large Multimodal Models as Visual Foundation Agents☆263Apr 24, 2025Updated 11 months ago
- ☆22May 23, 2025Updated 10 months ago
- GUICourse: From General Vision Langauge Models to Versatile GUI Agents☆141Mar 1, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository contains the opensource version of the datasets were used for different parts of training and testing of models that grou…☆34Aug 20, 2020Updated 5 years ago
- [ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…☆153Jan 3, 2026Updated 3 months ago
- ☆45Mar 19, 2024Updated 2 years ago
- Track and Collaborate on ML & AI Experiments.☆44Mar 10, 2025Updated last year
- Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"☆29Jul 31, 2024Updated last year
- ☆15Jul 6, 2022Updated 3 years ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆65Oct 19, 2024Updated last year
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Jun 5, 2025Updated 10 months ago
- ReasoningShield: Safety Detection over Reasoning Traces of Large Reasoning Models☆26Sep 27, 2025Updated 6 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ACL'26 Findings] Official code for "BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search"☆23Updated this week
- ☆52Dec 31, 2024Updated last year
- Implementation of KDR-Agent, the AAAI 2025 accepted paper, focusing on knowledge-driven reasoning for autonomous agents.☆18Nov 24, 2025Updated 4 months ago
- ☆14May 1, 2023Updated 2 years ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆149Nov 26, 2024Updated last year
- Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)☆101Oct 14, 2024Updated last year
- Environments, tools, and benchmarks for general computer agents☆14Dec 3, 2024Updated last year
- Generic EKF, with support for non-Euclidean manifolds☆25Apr 6, 2022Updated 4 years ago
- Code for 'Mapping State Space using Landmarks for Universal Goal Reaching'.☆16Dec 26, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Deep-RL algorithm Implementations using Pytorch☆16Jun 2, 2023Updated 2 years ago
- Instruction Following Agents with Multimodal Transforemrs☆54Nov 3, 2022Updated 3 years ago
- Code for Paper "Towards More Generalizable One-Shot Visual Imitation Learning", ICRA 2022☆20May 5, 2022Updated 3 years ago
- Tools for working with the S800 corpus☆12Sep 17, 2020Updated 5 years ago
- Unrailed! simulator using C++ with some reinforcement learning and Unrailed! AI using Python with OpenCV☆17Dec 6, 2021Updated 4 years ago
- Official implementation of Tabular Transfer Learning via Prompting LLMs (COLM 2024).☆13Aug 6, 2024Updated last year
- Large-Vocabulary Continuous Sign Language Recognition, 2024☆16May 30, 2024Updated last year