microsoft / OmniParserLinks
A simple screen parsing tool towards pure vision based GUI agent
☆22,426Updated 2 months ago
Alternatives and similar repositories for OmniParser
Users that are interested in OmniParser are comparing it to the libraries listed below
Sorting:
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆40,227Updated this week
- Memory for AI Agents; Announcing OpenMemory MCP - local and secure memory management.☆34,513Updated this week
- No fortress, purely open ground. OpenManus is Coming.☆46,905Updated this week
- A collection of MCP servers.☆55,987Updated this week
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆17,060Updated last week
- A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.☆14,662Updated this week
- Roo Code (prev. Roo Cline) gives you a whole dev team of AI agents in your code editor.☆15,579Updated this week
- 🖥️ Run AI Agent in your browser.☆13,754Updated 2 weeks ago
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆63,261Updated this week
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆19,933Updated 3 months ago
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.☆40,545Updated last week
- Build AI Agents, Visually☆40,309Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆24,159Updated this week
- Model Context Protocol Servers☆54,658Updated this week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆6,274Updated 3 weeks ago
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆45,908Updated this week
- Your AI Operator for Web, Android, Automation & Testing.☆9,240Updated this week
- ☆6,453Updated last month
- OCR, layout analysis, reading order, table recognition in 90+ languages☆17,641Updated last week
- Full-stack framework for building Multi-Agent Systems with memory, knowledge and reasoning.☆28,467Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆8,862Updated last month
- Toolkit for linearizing PDFs for LLM datasets/training☆12,940Updated this week
- A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/aut…☆46,091Updated this week
- 🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming☆56,520Updated last week
- 🪄 Create rich visualizations with AI☆12,419Updated last week
- Fully local web research and report writing assistant☆7,601Updated 2 months ago
- LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.☆21,953Updated this week
- 利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.☆36,821Updated last week
- Python scraper based on AI☆20,007Updated this week
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆35,508Updated this week