xiaomi-research / guievalkitView external linksLinks
GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents
☆19Jan 26, 2026Updated 3 weeks ago
Alternatives and similar repositories for guievalkit
Users that are interested in guievalkit are comparing it to the libraries listed below
Sorting:
- [ICCV 2025 Highlight] Less is More: Empowering GUI Agent with Context-Aware Simplification☆41Jan 21, 2026Updated 3 weeks ago
- ☆62Sep 6, 2025Updated 5 months ago
- Latest Papers, Codes and Datasets on VTG-LLMs.☆80Nov 17, 2025Updated 2 months ago
- Code for the paper "Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection"☆10Aug 7, 2023Updated 2 years ago
- ☆11Jun 3, 2024Updated last year
- A mobile GUI search engine using a vision-language model☆14May 5, 2025Updated 9 months ago
- [AAAI2025 Oral] BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking☆12Apr 22, 2025Updated 9 months ago
- Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)☆10Dec 12, 2023Updated 2 years ago
- ☆17Oct 30, 2023Updated 2 years ago
- ☆30Updated this week
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆13Jul 27, 2025Updated 6 months ago
- LaTeX Proposal Template for the University of Chinese Academy of Sciences☆18Oct 14, 2023Updated 2 years ago
- [NAACL 2024] CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions☆13May 7, 2024Updated last year
- SCU Virtual Judge☆11Feb 16, 2023Updated 3 years ago
- The code for HerO: a fact-checking pipeline based on open LLMs (the runner-up in AVeriTeC)☆13Mar 18, 2025Updated 10 months ago
- Open source code for paper "Learning World Models with Identifiable Factorization"☆13Mar 4, 2024Updated last year
- ☆28Jun 20, 2025Updated 7 months ago
- Official implementation of "TailorKV: A Hybrid Framework for Long-Context Inference via Tailored KV Cache Optimization" (Findings of ACL …☆21Jul 25, 2025Updated 6 months ago
- Nex General Agentic Data Pipeline, an end-to-end pipeline for generating high-quality agentic training data.☆30Nov 19, 2025Updated 2 months ago
- ☆71Nov 27, 2024Updated last year
- ☆19Mar 26, 2025Updated 10 months ago
- A simple visual test-time scaling method for GUI agent grounding☆20Dec 7, 2025Updated 2 months ago
- Self-Supervised Dataset Distillation for Transfer Learning☆16Apr 10, 2024Updated last year
- ☆11Dec 4, 2023Updated 2 years ago
- [WWW2024 Oral] Harnessing Multi-Role Capabilities of Large Language Models for Open-Domain Question Answering☆15Apr 22, 2025Updated 9 months ago
- Android load custom dex file☆12Aug 16, 2014Updated 11 years ago
- ☆17Mar 22, 2024Updated last year
- AI Video Translator / it uses ai to transcribe, translate and then reVoice a video into english in the original speakers voice☆18Jun 21, 2023Updated 2 years ago
- 🎭 Explore the magic of face swapping with "Painted Skin"! 🎭 Upload a source picture or video, select a target picture, and let the fun …☆19Nov 9, 2023Updated 2 years ago
- ☆17Oct 31, 2023Updated 2 years ago
- [experimental] FFMPEG hardware encoder for Android☆15Sep 8, 2020Updated 5 years ago
- 斯坦福吴恩达教授在Coursera上的机器学习课程的课件,作业及鄙人的笔记。☆15Oct 18, 2017Updated 8 years ago
- 记录日常的读书笔记☆17Mar 29, 2021Updated 4 years ago
- ☆22May 23, 2025Updated 8 months ago
- A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases☆14Mar 22, 2022Updated 3 years ago
- ☆22Jul 16, 2024Updated last year
- Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents☆27Dec 6, 2024Updated last year
- About Data and Codes for EMNLP 2023 System Demo Paper "QACHECK: A Demonstration System for Question-Guided Multi-Hop Fact-Checking"☆19Dec 19, 2023Updated 2 years ago
- Official PyTorch implementation of "Multisize Dataset Condensation" (ICLR'24 Oral)☆15Apr 18, 2024Updated last year