microsoft / OmniParserLinks
A simple screen parsing tool towards pure vision based GUI agent
β24,344Updated 4 months ago
Alternatives and similar repositories for OmniParser
Users that are interested in OmniParser are comparing it to the libraries listed below
Sorting:
- π Make websites accessible for AI agents. Automate tasks online with ease.β77,901Updated this week
- No fortress, purely open ground. OpenManus is Coming.β54,001Updated last month
- π₯οΈ Run AI Agent in your browser.β15,562Updated 5 months ago
- π¦ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automationβ19,004Updated last week
- Pioneering Automated GUI Interaction with Native Agentsβ9,134Updated last week
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infraβ25,104Updated 3 weeks ago
- πͺ Create rich visualizations with AIβ14,801Updated this week
- ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNβ59,492Updated this week
- Driving all platforms UI automation with vision-based modelβ11,532Updated this week
- Agent S: an open agentic framework that uses computers like a humanβ9,671Updated 2 weeks ago
- Toolkit for linearizing PDFs for LLM datasets/trainingβ16,833Updated last week
- A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.β5,370Updated 4 months ago
- An AI Hedge Fund Teamβ45,552Updated 2 months ago
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.β7,551Updated 2 months ago
- Fully local web research and report writing assistantβ8,494Updated 6 months ago
- Integrate the DeepSeek API into popular softwaresβ35,349Updated 4 months ago
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.β27,861Updated 4 months ago
- Kortix β build, manage and train AI Agents.β19,325Updated this week
- An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and largeβ¦β18,419Updated 5 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ19,228Updated this week
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural lβ¦β3,824Updated 8 months ago
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.β53,776Updated this week
- screenpipe turns your computer into a personal AI that knows everything you've done. record. search. automate. all local, all private, alβ¦β16,679Updated this week
- Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost β¦β24,874Updated 2 months ago
- Automate browser based workflows with AIβ20,305Updated this week
- OCR & Document Extraction using vision modelsβ12,070Updated 8 months ago
- Fine-tuning & Reinforcement Learning for LLMs. π¦₯ Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.β51,625Updated this week
- The AI Browser Automation Frameworkβ20,833Updated this week
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phoneβ23,054Updated this week
- Ingest, parse, and optimize any data format β‘οΈ from documents to multimedia β‘οΈ for enhanced compatibility with GenAI frameworksβ6,794Updated last month