microsoft / OmniParserLinks
A simple screen parsing tool towards pure vision based GUI agent
☆23,603Updated 2 weeks ago
Alternatives and similar repositories for OmniParser
Users that are interested in OmniParser are comparing it to the libraries listed below
Sorting:
- ☆7,827Updated this week
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra☆19,041Updated this week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆70,580Updated this week
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆18,158Updated last week
- Your AI Operator for Web, Android, Automation & Testing.☆10,356Updated last week
- Toolkit for linearizing PDFs for LLM datasets/training☆14,167Updated this week
- Kortix – build, manage and train AI Agents. Fully Open Source.☆18,191Updated this week
- 🖥️ Run AI Agent in your browser.☆14,923Updated last month
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆11,663Updated 2 months ago
- The AI Browser Automation Framework☆17,236Updated this week
- Train your AI self, amplify you, bridge the world☆14,331Updated last week
- 🪄 Create rich visualizations with AI☆13,709Updated this week
- Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.☆40,544Updated this week
- 🚀 The fast, Pythonic way to build MCP servers and clients☆18,373Updated this week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆7,007Updated 2 months ago
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆20,461Updated 6 months ago
- No fortress, purely open ground. OpenManus is Coming.☆50,076Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆50,771Updated this week
- Task-Aware Agent-driven Prompt Optimization Framework☆3,605Updated last month
- A research prototype of a human-centered web agent☆7,724Updated this week
- Automate browser-based workflows with LLMs and Computer Vision☆14,476Updated this week
- The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data 🔥☆59,493Updated this week
- Agent S: an open agentic framework that uses computers like a human☆6,284Updated last month
- OCR & Document Extraction using vision models☆11,851Updated 4 months ago
- Python tool for converting files and office documents to Markdown.☆80,293Updated 3 weeks ago
- Build Real-Time Knowledge Graphs for AI Agents☆18,431Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆28,911Updated this week
- The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.☆4,215Updated this week
- An open-source RAG-based tool for chatting with your documents.☆24,355Updated 2 months ago
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆29,382Updated this week