GAIR-NLP / PC-Agent
PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World
☆172Updated last month
Alternatives and similar repositories for PC-Agent:
Users that are interested in PC-Agent are comparing it to the libraries listed below
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆121Updated last month
- ☆184Updated 2 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆210Updated 2 weeks ago
- Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆86Updated this week
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆129Updated 3 weeks ago
- Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)☆69Updated 3 months ago
- connecting humans and agents☆67Updated last month
- Towards Large Multimodal Models as Visual Foundation Agents☆167Updated last month
- GitHub page for "Large Language Model-Brained GUI Agents: A Survey"☆102Updated last week
- A series of technical report on Slow Thinking with LLM☆359Updated this week
- Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction☆185Updated 2 weeks ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆144Updated this week
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆191Updated 3 months ago
- GUICourse: From General Vision Langauge Models to Versatile GUI Agents☆96Updated 6 months ago
- ☆208Updated 9 months ago
- ☆78Updated 2 months ago
- ✨✨Latest Papers and Datasets on Mobile and PC GUI Agent☆95Updated 2 months ago
- ☆87Updated 10 months ago
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.☆298Updated 2 months ago
- AndroidWorld is an environment and benchmark for autonomous agents☆189Updated last week
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆60Updated 4 months ago
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆104Updated last month
- ☆38Updated 2 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆282Updated this week
- UGround: Universal GUI Visual Grounding for GUI Agents☆147Updated this week
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆209Updated 3 months ago
- The model, data and code for the visual GUI Agent SeeClick☆294Updated 2 months ago
- ☆163Updated 9 months ago
- ☆164Updated last month
- [ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning☆203Updated 2 weeks ago