THUDM / Android-Lab
☆156Updated this week
Related projects ⓘ
Alternatives and complementary repositories for Android-Lab
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆213Updated last week
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆193Updated last month
- The model, data and code for the visual GUI Agent SeeClick☆227Updated this week
- ☆287Updated 2 months ago
- ☆222Updated 3 months ago
- Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)☆198Updated 4 months ago
- ☆194Updated 7 months ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆221Updated 7 months ago
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆173Updated this week
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆115Updated 2 months ago
- [ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning☆179Updated last month
- This is a collection of resources for computer-use agents, including videos, blogs, papers, and projects.☆105Updated 2 weeks ago
- Towards Large Multimodal Models as Visual Foundation Agents☆123Updated last week
- [ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"☆358Updated last month
- ☆116Updated 5 months ago
- Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi e…☆354Updated 2 months ago
- LongQLoRA: Extent Context Length of LLMs Efficiently☆159Updated last year
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆156Updated 7 months ago
- GUICourse: From General Vision Langauge Models to Versatile GUI Agents☆84Updated 4 months ago
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆231Updated 7 months ago
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆167Updated last month
- Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.☆139Updated last year
- Environments, tools, and benchmarks for general computer agents☆172Updated last month
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆220Updated 3 weeks ago
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆325Updated last month
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.☆265Updated last month
- RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness☆246Updated 2 weeks ago
- Reformatted Alignment☆112Updated 2 months ago
- GUI Odyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUI Odyssey consists of 7,735 episodes fr…☆69Updated last week
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆66Updated this week