xiaomi-research/guievalkit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xiaomi-research/guievalkit)

xiaomi-research / guievalkit

[ICML 2026] GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents

☆24

Alternatives and similar repositories for guievalkit

Users that are interested in guievalkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Darwin-Agent / awesome-world-models-for-digital-agents
View on GitHub
Digital Agents Meet World Models: A Survey
☆50May 8, 2026Updated 2 months ago
StarWalkin / UI-NEXUS
View on GitHub
This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…
☆14Jul 27, 2025Updated last year
SceneDroid / SceneDroid
View on GitHub
☆17Oct 30, 2023Updated 2 years ago
lgy0404 / MemGUI-Bench
View on GitHub
[ACM MM 2026] MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments
☆46Jul 13, 2026Updated 2 weeks ago
Jl-wei / guing
View on GitHub
A mobile GUI search engine using a vision-language model
☆15May 5, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tiangeluo / RegionFocus
View on GitHub
A simple visual test-time scaling method for GUI agent grounding
☆26Dec 7, 2025Updated 7 months ago
XiaoMi / DetermLR
View on GitHub
Open source code for paper
☆14May 27, 2024Updated 2 years ago
UITron-hub / UItron
View on GitHub
☆67Sep 6, 2025Updated 10 months ago
OSU-NLP-Group / GUI-Agents-Paper-List
View on GitHub
Awesome GUI Agent Paper List
☆865Jun 28, 2026Updated last month
gpengzhi / CrossConST-MT
View on GitHub
Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency …
☆10Jul 18, 2023Updated 3 years ago
MobileLLM / LLM-Explorer
View on GitHub
☆25Jun 1, 2026Updated last month
InfiXAI / InfiGUI-R1
View on GitHub
Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"
☆67Dec 4, 2025Updated 7 months ago
iLearn-Lab / ACL25-GUI-explorer
View on GitHub
[ACL 2025] GUI-explorer: Autonomous Exploration and Mining of Transition-aware Knowledge for GUI Agent
☆68May 28, 2025Updated last year
ZJU-REAL / Awesome-GUI-Agents
View on GitHub
A curated collection of resources, tools, and frameworks for developing GUI Agents.
☆446Updated this week
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
ritzz-ai / GUI-R1
View on GitHub
Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents
☆252May 5, 2025Updated last year
wmt-conference / wmt23-news-systems
View on GitHub
☆14Oct 6, 2025Updated 9 months ago
akhtarnabeel / COSE-Serverless-Configuration
View on GitHub
COSE: Configuring Serverless Functions using Statistical Learning
☆10Jun 28, 2023Updated 3 years ago
X-LANCE / text2sql-multiturn-GPT
View on GitHub
[NAACL 2024] CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions
☆13May 7, 2024Updated 2 years ago
GoodwillComputingLab / CLITE
View on GitHub
☆10Mar 14, 2020Updated 6 years ago
JiuTian-VL / SimpAgent
View on GitHub
[ICCV 2025 Highlight] Less is More: Empowering GUI Agent with Context-Aware Simplification
☆48Mar 12, 2026Updated 4 months ago
OpenGVLab / GUI-Odyssey
View on GitHub
[ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…
☆159Jan 3, 2026Updated 6 months ago
alibaba / MobiZen-GUI
View on GitHub
☆46Mar 25, 2026Updated 4 months ago
cdxeve / awesome-computer-use-agents
View on GitHub
A curated list of papers, tools, and benchmarks on LLM-based computer-use agents, covering both terminal/CLI and GUI approaches.
☆16May 21, 2026Updated 2 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
Trae1ounG / BuPO
View on GitHub
[arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
☆60Feb 6, 2026Updated 5 months ago
ZJU-ACES-ISE / ChatUITest
View on GitHub
Under construction
☆14Jan 15, 2025Updated last year
db-Lee / selfsup_dd
View on GitHub
Self-Supervised Dataset Distillation for Transfer Learning
☆19Apr 10, 2024Updated 2 years ago
chaojin0310 / Ditto
View on GitHub
Artifacts for our SIGCOMM'23 paper Ditto
☆15Oct 17, 2023Updated 2 years ago
597358816 / AEPO
View on GitHub
Arbitrary Entropy Policy Optimization: Entropy Is Controllable in Reinforcement Fine-tuning
☆17Jan 19, 2026Updated 6 months ago
portals-project / portals
View on GitHub
Portals is a framework for stateful serverless apps, unifying dataflow streaming with actors
☆20Nov 15, 2023Updated 2 years ago
COS-IN / iluvatar-faas
View on GitHub
Ilúvatar is an open Serverless platform built with the goal of jumpstarting and streamlining FaaS research. It provides a system that is …
☆26Jul 19, 2026Updated last week
xiaomi-research / btl-ui
View on GitHub
[NeurIPS 2025] Implementation of the paper "BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent"
☆19Nov 27, 2025Updated 8 months ago
he-y / Multisize-Dataset-Condensation
View on GitHub
Official PyTorch implementation of "Multisize Dataset Condensation" (ICLR'24 Oral)
☆17Apr 18, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
wwfnb / Laser
View on GitHub
☆16Sep 16, 2025Updated 10 months ago
GSR-SQL / GSR
View on GitHub
LLM Prompting for Text2SQL via Gradual SQL Reffnement
☆15Feb 19, 2025Updated last year
Kuberboat / Kuberboat
View on GitHub
A system which deploys and manages containerized applications. Course project of SJTU SE3356, 2022.
☆16Jun 29, 2022Updated 4 years ago
EthanLeo-LYX / LLMQA
View on GitHub
[WWW2024 Oral] Harnessing Multi-Role Capabilities of Large Language Models for Open-Domain Question Answering
☆15Apr 22, 2025Updated last year
MichSchli / AVeriTeC
View on GitHub
☆75Nov 27, 2024Updated last year
deerishi / graph-based-semi-supervised-learning
View on GitHub
This project explores the different techniques (both scalable and non scalable) for Graph based semi supervised learning. Recent techniqu…
☆14May 28, 2016Updated 10 years ago
AlexWanghaoming / CBPR
View on GitHub
[NeurIPS 2024] Beyond Single Stationary Policies: Meta-Task Players as Naturally Superior Collaborators
☆16Nov 15, 2024Updated last year