OthersideAI / self-operating-computer
A framework to enable multimodal models to operate a computer.
☆9,291Updated 2 weeks ago
Alternatives and similar repositories for self-operating-computer:
Users that are interested in self-operating-computer are comparing it to the libraries listed below
- Automate browser-based workflows with LLMs and Computer Vision☆12,133Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆17,500Updated this week
- SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensiv…☆14,598Updated this week
- Crawl a site to generate knowledge files to create your own custom GPT from a URL☆20,844Updated 3 weeks ago
- A code-first agent framework for seamlessly planning and executing data analytics tasks.☆5,519Updated last week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆25,920Updated this week
- AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.☆5,481Updated 6 months ago
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆7,822Updated this week
- Large Action Model framework to develop AI Web Agents☆5,879Updated 3 weeks ago
- A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/aut…☆39,522Updated this week
- tiny vision language model☆7,392Updated last week
- Letta (formerly MemGPT) is a framework for creating LLM services with memory.☆14,495Updated this week
- ☆6,563Updated 2 weeks ago
- Private & local AI personal knowledge management app for high entropy people.☆7,630Updated 2 months ago
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording☆12,096Updated this week
- LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.☆18,470Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆19,878Updated this week
- Python scraper based on AI☆18,071Updated this week
- 🚀🎬 ShortGPT - Experimental AI framework for youtube shorts / tiktok channel automation☆6,133Updated last week
- The #1 open-source voice interface for desktop, mobile, and ESP32 chips.☆5,029Updated 3 months ago
- OpenUI let's you describe UI using your imagination, then see it rendered live.☆19,918Updated 3 months ago
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆6,811Updated 2 weeks ago
- A natural language interface for computers☆58,350Updated 3 weeks ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆16,260Updated this week
- We write your reusable computer vision tools. 💜☆24,887Updated last week
- Chatbot for documentation, that allows you to chat with your data. Privately deployable, provides AI knowledge sharing and integrates kno…☆15,338Updated this week
- A collection of GPT system prompts and various prompt injection/leaking knowledge.☆8,558Updated this week
- Agno is a lightweight library for building multi-modal Agents☆18,930Updated this week
- A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.☆10,342Updated 3 weeks ago
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,838Updated 4 months ago