addy999 / omniparser-apiLinks
Self-hosted version of Microsoft's OmniParser Image-to-text model
☆66Updated this week
Alternatives and similar repositories for omniparser-api
Users that are interested in omniparser-api are comparing it to the libraries listed below
Sorting:
- AI web agent to find answers to any question☆33Updated this week
- List of Open Source projects built on Browser Use☆71Updated last month
- AI conflict resolution framework designed to work alongside existing AI orchestration tools☆24Updated 5 months ago
- 🔥 LitLytics - an affordable, simple analytics platform that leverages LLMs to automate data analysis☆99Updated 6 months ago
- Supercompat allows you to use any AI provider like Anthropic, Groq or Mistral with OpenAI-compatible Assistants API.☆81Updated 2 weeks ago
- Pocket Flow Tutorial Project: AI Paul Graham, just in case you don't get in...☆48Updated 2 months ago
- Model Context Protocol Servers (Browserbase Version)☆48Updated 6 months ago
- OmniMCP uses Microsoft OmniParser and Model Context Protocol (MCP) to provide AI models with rich UI context and powerful interaction cap…☆43Updated last month
- One connection for all your MCP servers.☆40Updated last month
- The easiest way to get structured data from unstructured text or images using LLMs. No prompt engineering, no chat history, just a simple…☆53Updated last month
- FastAPI server implementing MCP protocol Browser automation via browser-use library.☆42Updated this week
- ☆40Updated last month
- A MCP server connecting to managed indexes on LlamaCloud☆76Updated 3 weeks ago
- Record voice notes & transcribe, summarize, and get tasks☆42Updated last year
- Turn any input document into a sophisticated, context-dependent mindmap that distills the meaning and structure of the document.☆54Updated 3 months ago
- iauto is a low-code engine for building and deploying AI agents☆87Updated 6 months ago
- Redact PDF/image-based documents, or CSV/XLSX files using a Gradio-based GUI interface☆20Updated this week
- Phi4 Multimodal Instruct - OpenAI endpoint and Docker Image for self-hosting☆37Updated 3 months ago
- Embed anything.☆28Updated last year
- ☆30Updated 4 months ago
- ☆29Updated 3 months ago
- Chrome extension that interacts with content using Groq☆41Updated 4 months ago
- A lightweight tool to optimize your Javascript / Typescript project for LLM context windows by using a knowledge graph | AI code understa…☆53Updated 6 months ago
- Adaptive Modular Network (AMN) a potentially novel machine learning architecture capable of producing models which can learn at inference…☆52Updated 2 months ago
- Used to take chat histories from Windsurf and convert it to a condensed file that can be used on new sessions☆20Updated 4 months ago
- 👷♂️Minion is Agent's Brain. Minion is designed to execute any type of queries, offering a variety of features that demonstrate its flex…☆17Updated this week
- Automated web scraping spider generation using Browser Use and LLMs. Streamline the creation of Playwright-based spiders with minimal man…☆67Updated 2 weeks ago
- Curated resources about automated GUI computer-use via LLMs. Highly opinionated, focus is on quality vs quantity.☆22Updated 6 months ago
- Build reliable, secure, and production-ready AI apps easily.☆72Updated last week
- A memory framework for Large Language Models and Agents.☆179Updated 5 months ago