ishan0102 / vimGPT
Browse the web with GPT-4V and Vimium
☆2,663Updated 4 months ago
Alternatives and similar repositories for vimGPT:
Users that are interested in vimGPT are comparing it to the libraries listed below
- Build browser agents for real world tasks☆1,000Updated last year
- Generate and auto-execute Python scripts in the cli☆1,792Updated 9 months ago
- Voice + Vision powered AI assistant that answers questions about any application, in context and in audio.☆1,142Updated last year
- A RAG LLM co-pilot for browsing the web, powered by local LLMs☆1,477Updated 3 weeks ago
- Vision utilities for web interaction agents 👀☆1,591Updated 2 months ago
- ☆2,580Updated last month
- The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-user…☆1,286Updated last week
- Cross-Platform, GPU Accelerated Whisper 🏎️☆1,774Updated 11 months ago
- Agents Capable of Self-Editing Their Prompts / Python Code☆754Updated 11 months ago
- FigmaChain is a set of Python scripts that generate HTML/CSS code based on Figma designs. Using OpenAI's GPT-3 model, FigmaChain enables …☆968Updated last year
- AI powered one-click comprehensive docs from transcripts and text.☆1,595Updated last week
- Generative fill in 3D.☆739Updated 2 months ago
- ✨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.☆2,257Updated 9 months ago
- Promptr is a CLI tool that applies plain language instructions to the filesystem. Instructions can utilize a liquidjs based templating sy…☆919Updated 2 months ago
- Draw a ui and make it real☆5,251Updated 3 weeks ago
- An AutoGPT agent that controls Chrome on your desktop☆1,720Updated last year
- A browser AI agent, using GPT-4☆707Updated last year
- 🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓☆3,215Updated this week
- OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophist…☆1,635Updated 9 months ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,580Updated 6 months ago
- Record voice notes & transcribe, summarize, and get tasks☆1,840Updated last week
- ☆2,501Updated 10 months ago
- A series of top performing Text to SQL LLMs☆870Updated last year
- Turn expensive prompts into cheap fine-tuned models☆2,544Updated 8 months ago
- Create browser automation as if you were teaching a human using GPT-4 Vision.☆577Updated last year
- Large Action Model framework to develop AI Web Agents☆5,897Updated last month
- Automate your browser with GPT-4☆1,135Updated last month
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆1,621Updated this week
- A GPT agent framework for invoking APIs☆731Updated last year
- A school for camelids☆1,210Updated last year