GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents
☆19Feb 26, 2026Updated last month
Alternatives and similar repositories for guievalkit
Users that are interested in guievalkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2025 Highlight] Less is More: Empowering GUI Agent with Context-Aware Simplification☆44Mar 12, 2026Updated 2 weeks ago
- ☆66Sep 6, 2025Updated 6 months ago
- Latest Papers, Codes and Datasets on VTG-LLMs.☆86Nov 17, 2025Updated 4 months ago
- Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents☆27Dec 6, 2024Updated last year
- Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)☆10Dec 12, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code for the paper "Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection"☆11Aug 7, 2023Updated 2 years ago
- Official implementation of "TailorKV: A Hybrid Framework for Long-Context Inference via Tailored KV Cache Optimization" (Findings of ACL …☆21Jul 25, 2025Updated 8 months ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆13Jul 27, 2025Updated 8 months ago
- [NAACL 2024] CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions☆13May 7, 2024Updated last year
- [CVPR 2026] HiconAgent: History Context-aware Policy Optimization for GUI Agents☆27Mar 9, 2026Updated 2 weeks ago
- ☆17Oct 30, 2023Updated 2 years ago
- A simple visual test-time scaling method for GUI agent grounding☆21Dec 7, 2025Updated 3 months ago
- Nex General Agentic Data Pipeline, an end-to-end pipeline for generating high-quality agentic training data.☆31Nov 19, 2025Updated 4 months ago
- Self-Supervised Dataset Distillation for Transfer Learning☆18Apr 10, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Open source code for paper "Learning World Models with Identifiable Factorization"☆13Mar 4, 2024Updated 2 years ago
- 记录日常的读书笔记☆17Mar 29, 2021Updated 5 years ago
- The code for HerO: a fact-checking pipeline based on open LLMs (the runner-up in AVeriTeC)☆15Mar 18, 2025Updated last year
- A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases☆14Mar 22, 2022Updated 4 years ago
- LaTeX Proposal Template for the University of Chinese Academy of Sciences☆18Oct 14, 2023Updated 2 years ago
- Official PyTorch implementation of "Multisize Dataset Condensation" (ICLR'24 Oral)☆15Apr 18, 2024Updated last year
- ☆21Mar 26, 2025Updated last year
- [AAAI 2026] Test-Time Reinforcement Learning for GUI Grounding via Region Consistency https://arxiv.org/abs/2508.05615☆62Nov 8, 2025Updated 4 months ago
- AI Video Translator / it uses ai to transcribe, translate and then reVoice a video into english in the original speakers voice☆19Jun 21, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Tensorflow implementation of the `intelligent synapse' model from [Zenke et al., (2017)] and application to the Permuted MNIST benchmark.☆22Aug 2, 2017Updated 8 years ago
- ☆73Nov 27, 2024Updated last year
- [AAAI2025 Oral] BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking☆13Apr 22, 2025Updated 11 months ago
- ☆17Oct 31, 2023Updated 2 years ago
- [ICCV 2025] Implementation of the paper "Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs"☆71Oct 25, 2025Updated 5 months ago
- TreeSearch: Structure-aware document retrieval without embeddings. 毫秒检索万级文档和大型代码库,并保留文档结构。☆103Updated this week
- ☆11Dec 4, 2023Updated 2 years ago
- collection for the common dataset in my research☆32Feb 23, 2020Updated 6 years ago
- About Data and Codes for EMNLP 2023 System Demo Paper "QACHECK: A Demonstration System for Question-Guided Multi-Hop Fact-Checking"☆19Dec 19, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆23Jul 16, 2024Updated last year
- Use this template to create the repo for your osmos::feed reader.☆27Feb 14, 2022Updated 4 years ago
- 🎭 Explore the magic of face swapping with "Painted Skin"! 🎭 Upload a source picture or video, select a target picture, and let the fun …☆19Nov 9, 2023Updated 2 years ago
- The source code of Paper "PathQG: Neural Question Generation from Facts".☆23Jan 4, 2021Updated 5 years ago
- Large Language Models(LLMs) of Code☆20Apr 23, 2023Updated 2 years ago
- COOM: Benchmarking Continual Reinforcement Learning on Doom☆22Mar 5, 2026Updated 3 weeks ago
- ☆17Mar 22, 2024Updated 2 years ago