A simple screen parsing tool towards pure vision based GUI agent
β24,406Sep 12, 2025Updated 5 months ago
Alternatives and similar repositories for OmniParser
Users that are interested in OmniParser are comparing it to the libraries listed below
Sorting:
- π Make websites accessible for AI agents. Automate tasks online with ease.β79,028Updated this week
- A programming framework for agentic AIβ54,956Jan 22, 2026Updated last month
- Production-ready platform for agentic workflow development.β130,029Updated this week
- Python tool for converting files and office documents to Markdown.β87,527Feb 20, 2026Updated last week
- ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNβ60,971Updated this week
- Universal memory layer for AI Agentsβ47,994Updated this week
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infraβ28,399Updated this week
- π OpenHands: AI-Driven Developmentβ68,154Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. π¦₯ Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.β52,724Updated this week
- π₯ The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured dataβ84,899Feb 22, 2026Updated last week
- No fortress, purely open ground. OpenManus is Coming.β54,814Feb 11, 2026Updated 2 weeks ago
- πͺ Create rich visualizations with AIβ15,069Updated this week
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creatβ¦β73,900Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your pβ¦β58,263Updated this week
- The programming language for agentic software. Build, run, and manage multi-agent systems at scale.β38,104Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) systemβ31,031Feb 20, 2026Updated last week
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phoneβ23,942Updated this week
- π The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programmingβ64,377Jan 21, 2026Updated last month
- Toolkit for linearizing PDFs for LLM datasets/trainingβ16,947Feb 19, 2026Updated last week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.β27,918Sep 30, 2025Updated 5 months ago
- Pioneering Automated GUI Interaction with Native Agentsβ9,712Jan 27, 2026Updated last month
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.β54,878Feb 21, 2026Updated last week
- Automate browser based workflows with AIβ20,530Updated this week
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.β54,870Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)β124,763Updated this week
- Driving all platforms UI automation with vision-based modelβ11,734Updated this week
- π¦ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automationβ19,129Updated this week
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.β21,026Mar 11, 2025Updated 11 months ago
- An open-source RAG-based tool for chatting with your documents.β25,152Jul 4, 2025Updated 7 months ago
- screenpipe turns your computer into a personal AI that knows everything you've done. record. search. automate. all local, all private, alβ¦β16,973Updated this week
- Get your documents ready for gen AIβ54,094Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing aβ¦β37,083Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMsβ71,234Updated this week
- The ultimate space for work and life β to find, build, and collaborate with agent teammates that grow with you. We are taking agent harneβ¦β72,564Updated this week
- SOTA Open Source TTSβ24,983Feb 2, 2026Updated 3 weeks ago
- A collection of MCP servers.β81,522Updated this week
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.β163,632Updated this week
- Run frontier AI locally.β41,742Updated this week
- The AI Browser Automation Frameworkβ21,261Updated this week