convergence-ai / webgamesLinks
Challenges for general-purpose web-browsing AI agents
☆58Updated 3 weeks ago
Alternatives and similar repositories for webgames
Users that are interested in webgames are comparing it to the libraries listed below
Sorting:
- Code for ScribeAgent paper☆58Updated 3 months ago
- Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!☆46Updated 2 months ago
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆42Updated this week
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆57Updated 6 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆68Updated 3 months ago
- ☆50Updated 3 weeks ago
- AGI SDK☆60Updated last week
- Agent computer interface for AI software engineer.☆85Updated this week
- ☆11Updated 11 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆57Updated 2 months ago
- Scaling Computer-Use Grounding via UI Decomposition and Synthesis☆79Updated last week
- Code for the paper "Coding Agents with Multimodal Browsing are Generalist Problem Solvers"☆48Updated this week
- ☆63Updated last month
- accompanying material for sleep-time compute paper☆93Updated last month
- ☆156Updated 3 months ago
- LLM reads a paper and produce a working prototype☆57Updated 2 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆80Updated 2 weeks ago
- 👷♂️Minion is Agent's Brain. Minion is designed to execute any type of queries, offering a variety of features that demonstrate its flex…☆22Updated last week
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆150Updated 4 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 7 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 4 months ago
- ☆40Updated 11 months ago
- ☆115Updated 4 months ago
- Open Agent Computer Interface☆73Updated 6 months ago
- ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!☆79Updated 2 weeks ago
- Very minimal (and stateless) agent framework☆44Updated 5 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 5 months ago
- HUD SDK☆64Updated last week
- SkillWeaver is a framework to enable web agent self-improvement through environment exploration and skill synthesis.☆84Updated 2 months ago
- Multimodal computer agent data collection program☆133Updated last year