Natural language browser automation
☆627Dec 21, 2024Updated last year
Alternatives and similar repositories for browserpilot
Users that are interested in browserpilot are comparing it to the libraries listed below
Sorting:
- Create browser automation as if you were teaching a human using GPT-4 Vision.☆587Feb 19, 2024Updated 2 years ago
- Large Action Model framework to develop AI Web Agents☆6,311Jan 21, 2025Updated last year
- Automate browser based workflows with AI☆20,530Updated this week
- Automate your browser with GPT-4☆1,263Jan 15, 2025Updated last year
- Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs)…☆1,491Feb 18, 2026Updated last week
- An AutoGPT agent that controls Chrome on your desktop☆1,746Oct 25, 2023Updated 2 years ago
- 🐣🕐📅 A simple utility to draft scheduling emails.☆12Sep 13, 2023Updated 2 years ago
- Vision utilities for web interaction agents 👀☆1,753Nov 25, 2024Updated last year
- Drive a browser with GPT-3☆1,935Jun 9, 2024Updated last year
- A Flask Server Demo Application showing off some llama-index LLM prompt magic, including file upload and parsing :)☆22Mar 1, 2023Updated 2 years ago
- Let LLMs manage your local dev environments☆27Jan 7, 2025Updated last year
- gpt + aria = ability to read browser contents☆73Jun 21, 2023Updated 2 years ago
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆160Feb 11, 2025Updated last year
- AutoBrowse is an autonomous AI agent that can perform web browsing tasks.☆94Nov 14, 2023Updated 2 years ago
- An app to organize your research: A Paper Based Approach☆22Feb 26, 2023Updated 3 years ago
- Command your browser with GPT☆421Feb 3, 2026Updated 3 weeks ago
- The AI Browser Automation Framework☆21,261Updated this week
- Ask questions against any git repository, and get a response from OpenAI GPT-3 model.☆77Aug 15, 2023Updated 2 years ago
- Plan-Validate-Solve (PVS) Agent for accurate, reliable and reproducable workflow automation☆349Sep 25, 2023Updated 2 years ago
- Agent driven automation starting with the web. Try it: https://www.emergence.ai/web-automation-api☆1,219Jun 3, 2025Updated 8 months ago
- AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI☆1,066Dec 9, 2024Updated last year
- The first AI agent that builds permissionless integrations through reverse engineering platforms' internal APIs.☆4,547Feb 18, 2026Updated last week
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app …☆6,415Feb 3, 2026Updated 3 weeks ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆1,337Nov 26, 2025Updated 3 months ago
- chatbot does what you ask, like open Google search, post a Tweet, etc.☆330Jul 2, 2025Updated 7 months ago
- Python package for webscraping in Natural language☆149Oct 28, 2023Updated 2 years ago
- GPT-powered bot that can automate complex online tasks using both the web browser and API calls.☆172Apr 3, 2023Updated 2 years ago
- 🌟BuroTonic: Revolutionizing Sales and Marketing with AI-Driven Business Intelligence - An adaptive team of virtual agents for precise cl…☆23Jul 19, 2025Updated 7 months ago
- an ambient intelligence library☆6,083Feb 19, 2026Updated last week
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…☆947Nov 5, 2025Updated 3 months ago
- Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"☆1,028Mar 4, 2024Updated last year
- structured outputs for llms☆12,428Updated this week
- Turn any webpage into structured data using LLMs☆6,195Feb 7, 2026Updated 3 weeks ago
- AI leetcode interviewer that assesses tech applicants. Built on Langchain and OpenAI APIs. Recruiter-focused and tracks progress and subm…☆15Jun 6, 2023Updated 2 years ago
- ☆14Mar 28, 2024Updated last year
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆15Updated this week
- A framework to enable multimodal models to operate a computer.☆10,158Sep 19, 2025Updated 5 months ago
- [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild☆4,710Nov 18, 2024Updated last year