showlab / computer_use_ootb
An out-of-the-box (OOTB) version of Anthropic Claude Computer Use for Windows and macOS
☆345Updated this week
Related projects ⓘ
Alternatives and complementary repositories for computer_use_ootb
- Agent S: an open agentic framework that uses computers like a human☆606Updated this week
- agent q - oss advanced reasoning and learning for autonomous ai agents☆350Updated last month
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆356Updated last month
- Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.☆483Updated this week
- Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"☆342Updated 8 months ago
- ☆407Updated last month
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…☆647Updated last week
- Flexible and powerful multi-agent AI framework☆313Updated 2 weeks ago
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆166Updated this week
- Examples of using E2B☆738Updated this week
- Desktop app powered by Claude’s computer use capability to control your computer☆259Updated 2 weeks ago
- AWM: Agent Workflow Memory☆205Updated last month
- multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at o…☆314Updated last month
- Agent driven automation starting with the web. Discord: https://discord.gg/wgNfmFuqJF☆818Updated this week
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆204Updated this week
- the framework/ sdk that lets you build browser controlling agents in 3 lines of code. join chat @ https://discord.gg/umgnyQU2K8☆453Updated last month
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆535Updated 3 weeks ago
- podcastfy.ai gradio demo app☆311Updated this week
- CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/☆191Updated last week
- Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhan…☆496Updated 5 months ago
- MacOS Demo for Claude Computer Use☆121Updated 3 weeks ago
- Automated Design of Agentic Systems☆1,038Updated this week
- OpenResearcher, an advanced Scientific Research Assistant☆408Updated last month
- ☆119Updated 3 weeks ago
- A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically dr…☆168Updated 2 weeks ago
- 🤠 Agent-as-a-Judge and DevAI dataset☆192Updated this week
- ☆294Updated 5 months ago
- The fastest way to build robust AI agents☆434Updated this week
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆328Updated 5 months ago