microsoft / OmniParserLinks
A simple screen parsing tool towards pure vision based GUI agent
☆23,813Updated 2 months ago
Alternatives and similar repositories for OmniParser
Users that are interested in OmniParser are comparing it to the libraries listed below
Sorting:
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra☆19,455Updated this week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆72,307Updated this week
- Your AI Operator for Web, Android, Automation & Testing.☆10,657Updated this week
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording☆15,953Updated 2 months ago
- ☆8,171Updated last week
- Toolkit for linearizing PDFs for LLM datasets/training☆15,909Updated last week
- 🖥️ Run AI Agent in your browser.☆15,182Updated 2 months ago
- No fortress, purely open ground. OpenManus is Coming.☆50,776Updated last week
- Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.☆42,816Updated last week
- ⚡ Easiest no code web data extraction platform • Instantly turn any website into API or spreadsheet ⚡☆13,838Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆12,323Updated last month
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆18,312Updated last month
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.☆50,993Updated last week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆55,537Updated last week
- Fully local web research and report writing assistant☆8,309Updated 3 months ago
- 🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data☆67,598Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.☆48,036Updated last week
- MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone☆22,200Updated last month
- A lightweight, powerful framework for multi-agent workflows☆17,245Updated this week
- Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!☆8,599Updated this week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆7,127Updated 4 months ago
- A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations☆15,663Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆115,188Updated this week
- Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆16,124Updated 2 weeks ago
- Python tool for converting files and office documents to Markdown.☆82,911Updated 3 weeks ago
- Use your locally running AI models to assist you in your web browsing☆7,264Updated this week
- DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execut…☆17,945Updated last week
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆25,335Updated last month
- Integrate the DeepSeek API into popular softwares☆34,378Updated last month
- A collection of MCP servers.☆74,900Updated this week