microsoft / OmniParserLinks
A simple screen parsing tool towards pure vision based GUI agent
β23,930Updated 2 months ago
Alternatives and similar repositories for OmniParser
Users that are interested in OmniParser are comparing it to the libraries listed below
Sorting:
- Toolkit for linearizing PDFs for LLM datasets/trainingβ16,058Updated last week
- π₯οΈ Run AI Agent in your browser.β15,248Updated 3 months ago
- Multi-agent framework, runtime and control plane. Built for speed, privacy, and scale.β35,542Updated this week
- π¦ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automationβ18,356Updated last week
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.β20,654Updated 8 months ago
- π₯ The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured dataβ68,626Updated this week
- π Make websites accessible for AI agents. Automate tasks online with ease.β73,105Updated this week
- β8,259Updated 2 weeks ago
- Universal memory layer for AI Agentsβ43,677Updated this week
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infraβ19,580Updated last week
- Your AI Operator for Web, Android, Automation & Testing.β10,743Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your pβ¦β52,683Updated this week
- π OpenHands: Code Less, Make Moreβ65,284Updated this week
- Run your own AI cluster at home with everyday devices π±π» π₯οΈββ32,584Updated 3 weeks ago
- Easily build AI systems with Evals, RAG, Agents, fine-tuning, synthetic data, and more.β4,420Updated last week
- ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNβ56,514Updated this week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.β7,172Updated last week
- A collection of MCP servers.β75,787Updated this week
- Kortix β build, manage and train AI Agents. Fully Open Source.β18,674Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.β12,476Updated 2 months ago
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing aβ¦β31,849Updated this week
- No fortress, purely open ground. OpenManus is Coming.β50,996Updated 2 weeks ago
- Letta is the platform for building stateful agents: open AI with advanced memory that can learn and self-improve over time.β19,291Updated this week
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural lβ¦β3,769Updated 6 months ago
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recordingβ16,023Updated 3 months ago
- Agent S: an open agentic framework that uses computers like a humanβ8,390Updated last month
- A lightweight, powerful framework for multi-agent workflowsβ17,499Updated last week
- Fine-tuning & Reinforcement Learning for LLMs. π¦₯ Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.β48,786Updated this week
- OCR & Document Extraction using vision modelsβ11,968Updated 6 months ago
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creatβ¦β68,321Updated last week