addy999 / omniparser-apiLinks
Self-hosted version of Microsoft's OmniParser Image-to-text model
โ69Updated last month
Alternatives and similar repositories for omniparser-api
Users that are interested in omniparser-api are comparing it to the libraries listed below
Sorting:
- ๐ฅ LitLytics - an affordable, simple analytics platform that leverages LLMs to automate data analysisโ100Updated 7 months ago
- AI web agent to find answers to any questionโ33Updated last month
- OmniMCP uses Microsoft OmniParser and Model Context Protocol (MCP) to provide AI models with rich UI context and powerful interaction capโฆโ50Updated 3 months ago
- ๐๐ช๐ง๐ฃ๐๐ฃ๐ ๐จ๐ข๐๐ก๐ก ๐ฉ๐๐จ๐ ๐๐๐จ๐๐ง๐๐ฅ๐ฉ๐๐ค๐ฃ๐จ ๐๐ฃ๐ฉ๐ค ๐ข๐๐๐ ๐ฅ๐ง๐ค๐ข๐ฅ๐ฉ๐จ ๐๐ช๐ฉ๐ค๐ข๐๐๐๐๐๐ก๐ก๐ฎ.โ83Updated 11 months ago
- iauto is a low-code engine for building and deploying AI agentsโ88Updated 7 months ago
- List of Open Source projects built on Browser Useโ88Updated 2 months ago
- Automated web scraping spider generation using Browser Use and LLMs. Streamline the creation of Playwright-based spiders with minimal manโฆโ77Updated 3 weeks ago
- Build Phone Calling Voice Agent fully powered by open source models.โ49Updated 2 months ago
- Turn any input document into a sophisticated, context-dependent mindmap that distills the meaning and structure of the document.โ66Updated 4 months ago
- Mixture of Agents Model for use with Claude Sonnet 3.5, Gemini 1.5 Pro & ChatGPT-4oโ37Updated last year
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...noโฆโ125Updated 8 months ago
- an auto coder which automatically fixes errors and improves the code from simple user promptโ37Updated 6 months ago
- An JS web client for connecting to Pipecat bots with voice and visionโ45Updated 6 months ago
- Augment AI agents with long-term memory through knowledge graph ๐งโ74Updated 9 months ago
- FastAPI server implementing MCP protocol Browser automation via browser-use library.โ50Updated this week
- A collection of cookbooks to help developers get started quickly with the Firecrawl API.โ47Updated 5 months ago
- Pocket Flow Tutorial Project: AI Paul Graham, just in case you don't get in...โ50Updated 4 months ago
- CodeWhisper: AI-Powered End-to-End Task Implementation & blazingly fast Codebase-to-LLM Context Bridgeโ87Updated 7 months ago
- MarinaBox is a toolkit for creating and managing secure, isolated environments for AI agentsโ133Updated 4 months ago
- ๐ฅโก๏ธ๐ Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Chโฆโ80Updated 10 months ago
- Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extrโฆโ204Updated last week
- A lightweight tool to optimize your Javascript / Typescript project for LLM context windows by using a knowledge graph | AI code understaโฆโ56Updated 7 months ago
- Web Agent is an automation tool driven by AI. Designed for seamless navigation and task execution on the web, it intelligently interacts โฆโ88Updated last week
- Build reliable, secure, and production-ready AI apps easily.โ74Updated this week
- Supercompat allows you to use any AI provider like Anthropic, Groq or Mistral with OpenAI-compatible Assistants API.โ87Updated 2 weeks ago
- โ82Updated 5 months ago
- LangGraph-GUI backend with fastapiโ58Updated last month
- A simple Python program to implement the search-extract-summarize flow.โ269Updated last month
- YouTube Script Writer is an open-source AI agent that generates tailored video scripts based on title, language, tone, and length. It strโฆโ12Updated 4 months ago
- Local first human friendly agents toolkit for the browser and Nodejsโ42Updated last month