microsoft / OmniParserLinks
A simple screen parsing tool towards pure vision based GUI agent
☆24,344Updated 4 months ago
Alternatives and similar repositories for OmniParser
Users that are interested in OmniParser are comparing it to the libraries listed below
Sorting:
- Toolkit for linearizing PDFs for LLM datasets/training☆16,860Updated this week
- OCR & Document Extraction using vision models☆12,070Updated 8 months ago
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆77,901Updated this week
- Pioneering Automated GUI Interaction with Native Agents☆9,343Updated last week
- 🖥️ Run AI Agent in your browser.☆15,562Updated 5 months ago
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆19,034Updated this week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆59,492Updated this week
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra☆27,325Updated 3 weeks ago
- Driving all platforms UI automation with vision-based model☆11,532Updated this week
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆20,914Updated 10 months ago
- Kortix – build, manage and train AI Agents.☆19,325Updated this week
- Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI O…☆12,162Updated 2 months ago
- 🚀 The fast, Pythonic way to build MCP servers and clients☆22,675Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆13,162Updated this week
- 🪄 Create rich visualizations with AI☆14,801Updated this week
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆6,794Updated last month
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural l…☆3,824Updated 8 months ago
- Automate browser based workflows with AI☆20,305Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆30,705Updated this week
- No fortress, purely open ground. OpenManus is Coming.☆54,333Updated last month
- ✨ Turn websites into structured APIs & clean data pipelines in minutes ✨☆14,208Updated this week
- A programming framework for agentic AI☆54,376Updated 2 weeks ago
- Build Real-Time Knowledge Graphs for AI Agents☆22,520Updated this week
- Production-ready platform for agentic workflow development.☆128,415Updated this week
- screenpipe turns your computer into a personal AI that knows everything you've done. record. search. automate. all local, all private, al…☆16,679Updated this week
- Agent S: an open agentic framework that uses computers like a human☆9,671Updated 2 weeks ago
- Convert PDF to markdown + JSON quickly with high accuracy☆31,421Updated last week
- A collection of MCP servers.☆80,333Updated last week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆7,563Updated 2 months ago
- A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations☆16,458Updated this week