microsoft / OmniParserLinks
A simple screen parsing tool towards pure vision based GUI agent
ā24,175Updated 4 months ago
Alternatives and similar repositories for OmniParser
Users that are interested in OmniParser are comparing it to the libraries listed below
Sorting:
- š Make websites accessible for AI agents. Automate tasks online with ease.ā75,163Updated this week
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infraā22,897Updated last week
- Pioneering Automated GUI Interaction with Native Agentsā8,885Updated this week
- š„ļø Run AI Agent in your browser.ā15,412Updated 4 months ago
- šš¤ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNā58,302Updated last week
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recordingā16,370Updated last month
- Driving all platforms UI automation with vision-based modelā11,137Updated last week
- Toolkit for linearizing PDFs for LLM datasets/trainingā16,710Updated this week
- š„ The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured dataā73,460Updated this week
- š¦ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automationā18,704Updated this week
- Agent S: an open agentic framework that uses computers like a humanā9,337Updated 3 weeks ago
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.ā7,311Updated last month
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/ā9,657Updated 8 months ago
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.ā27,774Updated 3 months ago
- Python scraper based on AIā22,184Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.ā12,863Updated 3 months ago
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.ā20,772Updated 10 months ago
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.ā52,990Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) systemā30,175Updated this week
- Kortix ā build, manage and train AI Agents.ā19,070Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languagesā19,089Updated 2 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.ā18,844Updated this week
- Production-ready platform for agentic workflow development.ā124,907Updated last week
- MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phoneā22,594Updated 3 months ago
- Turn websites into clean data pipelines & structured APIs in minutes!ā14,133Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your pā¦ā56,720Updated this week
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.ā26,044Updated 3 months ago
- An open-source RAG-based tool for chatting with your documents.ā24,833Updated 6 months ago
- Automate browser based workflows with AIā20,054Updated this week
- No fortress, purely open ground. OpenManus is Coming.ā52,731Updated last week