ranpox / awesome-computer-use
This is a collection of resources for computer-use agents, including videos, blogs, papers, and projects.
☆85Updated this week
Related projects ⓘ
Alternatives and complementary repositories for awesome-computer-use
- Reformatted Alignment☆112Updated last month
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆190Updated 3 weeks ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆166Updated this week
- ☆48Updated 8 months ago
- [ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning☆177Updated last month
- Environments, tools, and benchmarks for general computer agents☆172Updated 2 weeks ago
- Official Repo for UGround☆93Updated this week
- This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"☆85Updated last month
- ☆116Updated 5 months ago
- ☆128Updated last week
- Code for the paper 🌳 Tree Search for Language Model Agents☆138Updated 3 months ago
- Expert Specialized Fine-Tuning☆144Updated last month
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆75Updated 3 weeks ago
- ☆83Updated 7 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆46Updated last month
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆44Updated this week
- ☆283Updated last month
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆146Updated this week
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆106Updated 2 weeks ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆96Updated last week
- [NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?☆107Updated 2 months ago
- FireAct: Toward Language Agent Fine-tuning☆254Updated last year
- ☆78Updated 6 months ago
- ☆190Updated 2 months ago
- AndroidWorld is an environment and benchmark for autonomous agents☆127Updated this week
- ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆62Updated 7 months ago
- trending projects & awesome papers about data-centric llm studies.☆32Updated this week
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆47Updated 5 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆114Updated this week
- 🤠 Agent-as-a-Judge and DevAI dataset☆185Updated last week