Command your browser with GPT
☆421Feb 3, 2026Updated last month
Alternatives and similar repositories for BrowserGPT
Users that are interested in BrowserGPT are comparing it to the libraries listed below
Sorting:
- Create browser automation as if you were teaching a human using GPT-4 Vision.☆586Feb 19, 2024Updated 2 years ago
- Automate your browser with GPT-4☆1,268Jan 15, 2025Updated last year
- AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI☆1,065Dec 9, 2024Updated last year
- gpt + aria = ability to read browser contents☆73Jun 21, 2023Updated 2 years ago
- A chrome extension which looks up selected text via ChatGPT using your custom prompts☆25Jan 26, 2025Updated last year
- An AutoGPT agent that controls Chrome on your desktop☆1,745Oct 25, 2023Updated 2 years ago
- GPT-powered bot that can automate complex online tasks using both the web browser and API calls.☆172Apr 3, 2023Updated 2 years ago
- Natural language browser automation☆628Dec 21, 2024Updated last year
- Drive a browser with GPT-3☆1,936Jun 9, 2024Updated last year
- ☆18Aug 15, 2023Updated 2 years ago
- EvalGPT is an code interpreter framework that utilizes large language models to automate the process of code-writing and execution, deliv…☆248Sep 17, 2023Updated 2 years ago
- Vision utilities for web interaction agents 👀☆1,756Nov 25, 2024Updated last year
- 👀🧠 GPT-4 Vision x 💪⌨️ Vimium = Autonomous Web Agent☆168Nov 16, 2023Updated 2 years ago
- ☆13Apr 8, 2023Updated 2 years ago
- Chrome extension to watch YouTube videos ad-free☆17Jun 7, 2024Updated last year
- Browse the web with GPT-4V and Vimium☆2,663Sep 25, 2024Updated last year
- ☆232Mar 7, 2024Updated 2 years ago
- Voice + Vision powered AI assistant that answers questions about any application, in context and in audio.☆1,158Dec 21, 2023Updated 2 years ago
- Foundational Models for State-of-the-Art Speech and Text Translation☆11Sep 13, 2023Updated 2 years ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…☆961Nov 5, 2025Updated 4 months ago
- Large Action Model framework to develop AI Web Agents☆6,318Jan 21, 2025Updated last year
- AIlice is a fully autonomous, general-purpose AI agent.☆1,394Aug 18, 2025Updated 7 months ago
- A web-app to explore topics using LLM (less typing and more clicks)☆68Updated this week
- Google AI SDK for JavaScript☆15Mar 13, 2025Updated last year
- 🧬 The open source chat-ai toolkit☆916Aug 28, 2023Updated 2 years ago
- Pip Package for MirageML☆25Nov 7, 2023Updated 2 years ago
- GPT-4 Vision Chrome Extension☆108Nov 12, 2023Updated 2 years ago
- A starter for Langchain, Docker Compose, Fastapi, Qdrant, Sveltekit☆24May 4, 2023Updated 2 years ago
- A versatile workflow automation platform to create, organize, and execute AI workflows, from a single LLM to complex AI-driven workflows.☆537Dec 20, 2025Updated 3 months ago
- A framework to enable multimodal models to operate a computer.☆10,189Sep 19, 2025Updated 6 months ago
- A plugin to enable AutoGPT to interact with websites.☆263Nov 15, 2023Updated 2 years ago
- Example use cases for the GPT-4 Vision API☆19Nov 26, 2023Updated 2 years ago
- A GPT-4 powered AI agent that can create full projects with iterative prompting☆314Jan 31, 2024Updated 2 years ago
- Automate browser based workflows with AI☆20,834Updated this week
- Personal AI search copilot, open-source Perplexity☆783Aug 7, 2025Updated 7 months ago
- Videos Transcription and Translation with Faster Whisper and ChatGPT☆242Apr 13, 2024Updated last year
- This is a tool that uses GPT4 Vision to operate your computer☆30Dec 19, 2023Updated 2 years ago
- [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild☆4,728Nov 18, 2024Updated last year
- iauto is a low-code engine for building and deploying AI agents☆93Nov 22, 2024Updated last year