invariantlabs-ai / playwright-computer-use
Let Claude control a web browser on your machine.
☆26Updated last month
Alternatives and similar repositories for playwright-computer-use:
Users that are interested in playwright-computer-use are comparing it to the libraries listed below
- A better way of testing, inspecting, and analyzing AI Agent traces.☆35Updated last week
- OpenPipe ART (Agent Reinforcement Trainer): train LLM agents☆108Updated this week
- Verdict is a library for scaling judge-time compute.☆199Updated last week
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 3 months ago
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆50Updated last month
- Sphynx Hallucination Induction☆53Updated 2 months ago
- Model Context Protocol (MCP) Server for Langfuse Prompt Management. This server allows you to access and manage your Langfuse prompts thr…☆57Updated 2 months ago
- Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a sm…☆44Updated last week
- Browser extension to enable MCP in claude.ai☆45Updated last week
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated 9 months ago
- MCP to explore websites with llms.txt files☆33Updated last month
- An advanced distributed knowledge fabric for intelligent document processing, featuring multi-document agents, optimized query handling, …☆27Updated 8 months ago
- Agent computer interface for AI software engineer.☆63Updated this week
- ☆37Updated 2 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 5 months ago
- Letting Claude Code develop his own MCP tools :)☆99Updated last month
- ☆50Updated 5 months ago
- Challenges for general-purpose web-browsing AI agents☆46Updated last month
- A Python library to orchestrate LLMs in a neural network-inspired structure☆46Updated 6 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 2 months ago
- ☆38Updated 2 weeks ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆45Updated 2 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated 11 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆77Updated last month
- Code for ScribeAgent paper☆56Updated last month
- ☆47Updated last year
- Simple demo showing how to use the Forge API by Nous Research☆11Updated 5 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆25Updated 3 months ago
- Code interpreter support for o1☆32Updated 7 months ago
- Structured outputs from DSPy and Jinja2☆23Updated 4 months ago