microsoft / OmniParserLinks
A simple screen parsing tool towards pure vision based GUI agent
β23,813Updated last month
Alternatives and similar repositories for OmniParser
Users that are interested in OmniParser are comparing it to the libraries listed below
Sorting:
- π Make websites accessible for AI agents. Automate tasks online with ease.β72,307Updated this week
- β8,114Updated last week
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infraβ19,392Updated last week
- Your AI Operator for Web, Android, Automation & Testing.β10,606Updated this week
- π¦ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automationβ18,274Updated last month
- π₯οΈ Run AI Agent in your browser.β15,128Updated 2 months ago
- No fortress, purely open ground. OpenManus is Coming.β50,640Updated last week
- Vision agentβ5,093Updated 2 months ago
- ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNβ55,537Updated this week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.β7,111Updated 4 months ago
- Toolkit for linearizing PDFs for LLM datasets/trainingβ15,772Updated last week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.β12,220Updated last month
- Fine-tuning & Reinforcement Learning for LLMs. π¦₯ Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.β48,036Updated this week
- Agent S: an open agentic framework that uses computers like a humanβ7,981Updated last week
- πͺ Create rich visualizations with AIβ14,081Updated this week
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.β20,592Updated 8 months ago
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural lβ¦β3,741Updated 5 months ago
- A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.β5,351Updated last month
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagβ¦β30,866Updated this week
- Multi-agent framework, runtime and control plane. Built for speed, privacy, and scale.β34,876Updated this week
- A research prototype of a human-centered web agentβ7,903Updated last week
- Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.β42,816Updated this week
- Production-ready platform for agentic workflow development.β118,085Updated this week
- Integrate the DeepSeek API into popular softwaresβ34,378Updated last month
- An open protocol enabling communication and interoperability between opaque agentic applications.β20,566Updated this week
- An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and largeβ¦β18,018Updated 2 months ago
- Automate browser based workflows with AIβ17,115Updated this week
- Convert PDF to markdown + JSON quickly with high accuracyβ29,658Updated last week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.β27,579Updated last month
- A modular graph-based Retrieval-Augmented Generation (RAG) systemβ29,014Updated this week