THUDM / Android-Lab
☆128Updated last week
Related projects ⓘ
Alternatives and complementary repositories for Android-Lab
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆166Updated this week
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆190Updated 3 weeks ago
- ☆212Updated 3 months ago
- ☆283Updated last month
- ☆116Updated 5 months ago
- Reformatted Alignment☆112Updated last month
- Towards Large Multimodal Models as Visual Foundation Agents☆113Updated last week
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.☆254Updated last month
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆216Updated 6 months ago
- [ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning☆177Updated last month
- This is a collection of resources for computer-use agents, including videos, blogs, papers, and projects.☆85Updated this week
- Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)☆196Updated 3 months ago
- The model, data and code for the visual GUI Agent SeeClick☆216Updated 2 months ago
- Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.☆138Updated last year
- FireAct: Toward Language Agent Fine-tuning☆254Updated last year
- Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi e…☆346Updated 2 months ago
- GUI Odyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUI Odyssey consists of 7,735 episodes fr…☆64Updated 4 months ago
- ☆192Updated 6 months ago
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆230Updated 7 months ago
- 🤠 Agent-as-a-Judge and DevAI dataset☆184Updated last week
- Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models☆80Updated 7 months ago
- 💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.☆178Updated 2 weeks ago
- ☆89Updated 7 months ago
- A Comprehensive Benchmark for Software Development.☆84Updated 5 months ago
- ☆48Updated 8 months ago
- AndroidWorld is an environment and benchmark for autonomous agents☆125Updated this week
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆133Updated this week
- ☆67Updated 2 weeks ago
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆166Updated last month
- ☆129Updated 6 months ago