microsoft / OmniParserLinks
A simple screen parsing tool towards pure vision based GUI agent
ā24,175Updated 3 months ago
Alternatives and similar repositories for OmniParser
Users that are interested in OmniParser are comparing it to the libraries listed below
Sorting:
- š Make websites accessible for AI agents. Automate tasks online with ease.ā75,163Updated this week
- No fortress, purely open ground. OpenManus is Coming.ā52,178Updated last week
- Pioneering Automated GUI Interaction with Native Agentsā8,758Updated 2 weeks ago
- š„ļø Run AI Agent in your browser.ā15,412Updated 4 months ago
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infraā20,394Updated last week
- š¦ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automationā18,542Updated last week
- ā28,011Updated 5 months ago
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your pā¦ā56,720Updated this week
- OCR & Document Extraction using vision modelsā12,015Updated 7 months ago
- šš¤ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNā58,302Updated last week
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.ā20,772Updated 10 months ago
- Python tool for converting files and office documents to Markdown.ā84,994Updated last month
- Driving all platforms UI automation with vision-based modelā11,137Updated this week
- Toolkit for linearizing PDFs for LLM datasets/trainingā16,582Updated last week
- Agent S: an open agentic framework that uses computers like a humanā9,337Updated 3 weeks ago
- š¦ Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your cā¦ā21,001Updated last week
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recordingā16,370Updated last month
- Automate browser based workflows with AIā20,054Updated this week
- Roo Code gives you a whole dev team of AI agents in your code editor.ā21,668Updated this week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.ā7,288Updated last month
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.ā12,863Updated 3 months ago
- šŖ Create rich visualizations with AIā14,682Updated this week
- Model Context Protocol Serversā75,662Updated this week
- Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Oā¦ā11,876Updated last month
- An open-source RAG-based tool for chatting with your documents.ā24,833Updated 6 months ago
- A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.ā5,367Updated 3 months ago
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural lā¦ā3,798Updated 7 months ago
- An open protocol enabling communication and interoperability between opaque agentic applications.ā21,338Updated this week
- Lightweight coding agent that runs in your terminalā55,534Updated this week
- A lightweight, powerful framework for multi-agent workflowsā18,157Updated last week