aialt / awesome-mobile-agents
✨✨Latest Papers and Datasets on Mobile and PC GUI Agent
☆73Updated 2 weeks ago
Alternatives and similar repositories for awesome-mobile-agents:
Users that are interested in awesome-mobile-agents are comparing it to the libraries listed below
- GUI Odyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUI Odyssey consists of 7,735 episodes fr…☆76Updated last month
- ☆25Updated 2 months ago
- ☆81Updated last week
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆64Updated 2 weeks ago
- Towards Large Multimodal Models as Visual Foundation Agents☆142Updated 3 weeks ago
- The model, data and code for the visual GUI Agent SeeClick☆248Updated 3 weeks ago
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆84Updated last month
- GUICourse: From General Vision Langauge Models to Versatile GUI Agents☆92Updated 5 months ago
- [NeurIPS 2024] Needle In A Multimodal Haystack (MM-NIAH): A comprehensive benchmark designed to systematically evaluate the capability of…☆105Updated 3 weeks ago
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆49Updated 8 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆128Updated this week
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆60Updated 2 months ago
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆121Updated 3 months ago
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆48Updated last month
- An Easy-to-use Hallucination Detection Framework for LLMs.☆48Updated 7 months ago
- A Self-Training Framework for Vision-Language Reasoning☆40Updated last month
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆39Updated 5 months ago
- ☆174Updated 3 weeks ago
- A Survey on Benchmarks of Multimodal Large Language Models☆72Updated 2 months ago
- MATH-Vision dataset and code to measure Multimodal Mathematical Reasoning capabilities.☆73Updated 2 months ago
- RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness☆254Updated last week
- ☆34Updated 2 months ago
- AI Alignment: A Comprehensive Survey☆131Updated last year
- ☆97Updated 4 months ago
- Reformatted Alignment☆113Updated 2 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆201Updated 2 months ago
- [ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning☆186Updated 2 months ago
- Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)☆59Updated 2 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆151Updated last week
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆73Updated last month