microsoft / OmniParserLinks
A simple screen parsing tool towards pure vision based GUI agent
☆22,258Updated 2 months ago
Alternatives and similar repositories for OmniParser
Users that are interested in OmniParser are comparing it to the libraries listed below
Sorting:
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆44,538Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆38,945Updated this week
- A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.☆14,216Updated this week
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording☆14,834Updated this week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆61,918Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆44,611Updated this week
- Agno is a lightweight, high-performance library for building Agents.☆27,218Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆23,128Updated this week
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆16,646Updated this week
- 🖥️ Run AI Agent in your browser.☆13,323Updated last week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆24,399Updated 3 weeks ago
- Roo Code (prev. Roo Cline) gives you a whole dev team of AI agents in your code editor.☆14,715Updated this week
- Toolkit for linearizing PDFs for LLM datasets/training☆12,482Updated this week
- ☆6,194Updated last week
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆19,824Updated 2 months ago
- Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥☆39,558Updated this week
- No fortress, purely open ground. OpenManus is Coming.☆46,082Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆17,473Updated last week
- Memory for AI Agents; SOTA in AI Agent Memory; Announcing OpenMemory MCP - local and secure memory management.☆33,017Updated this week
- Your AI Operator for Web, Android, Automation & Testing.☆8,983Updated this week
- The official Python SDK for Model Context Protocol servers and clients☆13,211Updated this week
- OCR & Document Extraction using vision models☆11,232Updated last week
- A collection of MCP servers.☆51,932Updated this week
- 🚀 The fast, Pythonic way to build MCP servers and clients☆10,794Updated this week
- Build Real-Time Knowledge Graphs for AI Agents☆9,878Updated this week
- A lightweight, powerful framework for multi-agent workflows☆10,722Updated last week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆25,439Updated last week
- An open-source RAG-based tool for chatting with your documents.☆22,347Updated last month
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆34,203Updated this week
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆28,257Updated 2 months ago