addy999 / omniparser-api
Self-hosted version of Microsoft's OmniParser Image-to-text model
☆65Updated 5 months ago
Alternatives and similar repositories for omniparser-api
Users that are interested in omniparser-api are comparing it to the libraries listed below
Sorting:
- AI web agent to find answers to any question☆32Updated 3 months ago
- iauto is a low-code engine for building and deploying AI agents☆86Updated 5 months ago
- List of Open Source projects built on Browser Use☆59Updated last week
- Turn any input document into a sophisticated, context-dependent mindmap that distills the meaning and structure of the document.☆45Updated 2 months ago
- 🔥 LitLytics - an affordable, simple analytics platform that leverages LLMs to automate data analysis☆98Updated 5 months ago
- A memory framework for Large Language Models and Agents.☆178Updated 4 months ago
- Automated web scraping spider generation using Browser Use and LLMs. Streamline the creation of Playwright-based spiders with minimal man…☆66Updated last week
- LangGraph-GUI backend with fastapi☆53Updated 2 months ago
- A collection of cookbooks to help developers get started quickly with the Firecrawl API.☆45Updated 2 months ago
- Redact PDF/image-based documents, or CSV/XLSX files using a Gradio-based GUI interface☆17Updated this week
- MarinaBox is a toolkit for creating and managing secure, isolated environments for AI agents☆129Updated 2 months ago
- An experimental and alternative approach to Finetuning and RAG.☆35Updated last year
- Grapheteria: A structured framework bringing uniformity to agent orchestration!☆45Updated last week
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆120Updated 6 months ago
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and images☆37Updated last year
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Modified to use local Ollama endpoint☆50Updated 6 months ago
- Flowchart-like UI to interconnect LLM's and Huggingface models, and deploy them as a REST API with little to no code.☆71Updated last month
- A RAG system designed to process documents with multimodal content. It can generate factual, context-aware answers to user queries, based…☆21Updated 4 months ago
- ☆37Updated last month
- Mycomind Daemon: A mycelium-inspired, advanced Mixture-of-Memory-RAG-Agents (MoMRA) cognitive assistant that combines multiple AI models …☆32Updated 10 months ago
- FastAPI server implementing MCP protocol Browser automation via browser-use library.☆40Updated last week
- A CLI tool for easy installation of MCP servers and managing their configuration☆55Updated last month
- smart-llm-loader is a lightweight yet powerful Python package that transforms any document into LLM-ready chunks. Spend less time on prep…☆65Updated 2 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆60Updated 9 months ago
- Chat with any website on your local machine☆74Updated 10 months ago
- A virtual employee that scours the web, organizes data, and delivers results in a spreadsheet☆76Updated last week
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation☆103Updated 4 months ago
- Mixture of Agents Model for use with Claude Sonnet 3.5, Gemini 1.5 Pro & ChatGPT-4o☆36Updated 10 months ago
- InterfaceAgent: a versatile framework designed to create system and interface agents capable of managing mobile and desktop applications …☆112Updated last year
- A MCP server connecting to managed indexes on LlamaCloud☆74Updated last week