microsoft / OmniParserLinks
A simple screen parsing tool towards pure vision based GUI agent
ā22,643Updated 3 months ago
Alternatives and similar repositories for OmniParser
Users that are interested in OmniParser are comparing it to the libraries listed below
Sorting:
- The Open-sourced Multimodal AI Agent Stack connecting Cutting-edge AI Models and Agent Infra.ā15,206Updated this week
- š¦ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automationā17,507Updated this week
- ā6,687Updated last month
- š Make websites accessible for AI agents. Automate tasks online with ease.ā65,506Updated this week
- šŖ Create rich visualizations with AIā12,701Updated last week
- No fortress, purely open ground. OpenManus is Coming.ā48,014Updated this week
- š„ Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.ā43,098Updated this week
- Toolkit for linearizing PDFs for LLM datasets/trainingā13,234Updated this week
- The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.ā3,955Updated this week
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.ā20,099Updated 4 months ago
- Official inference framework for 1-bit LLMsā20,532Updated last month
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your pā¦ā47,757Updated this week
- Model Context Protocol Serversā60,223Updated this week
- š The fast, Pythonic way to build MCP servers and clientsā14,655Updated this week
- A lightweight, powerful framework for multi-agent workflowsā12,657Updated this week
- š„ļø Run AI Agent in your browser.ā14,192Updated last month
- A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.ā5,259Updated last month
- Build Real-Time Knowledge Graphs for AI Agentsā14,119Updated this week
- Suna - Open Source Generalist AI Agentā16,698Updated this week
- Automate browser-based workflows with LLMs and Computer Visionā13,816Updated this week
- Integrate the DeepSeek API into popular softwaresā33,179Updated 2 months ago
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.ā6,533Updated last week
- Run your own AI cluster at home with everyday devices š±š» š„ļøāā29,010Updated 3 months ago
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.ā10,089Updated last month
- Your AI Operator for Web, Android, Automation & Testing.ā9,648Updated this week
- A collection of MCP servers.ā62,323Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)ā103,120Updated this week
- Fully open reproduction of DeepSeek-R1ā25,056Updated last week
- šš¤ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNā47,851Updated last week
- Task-Aware Agent-driven Prompt Optimization Frameworkā3,404Updated last week