microsoft / OmniParser
A simple screen parsing tool towards pure vision based GUI agent
โ20,951Updated this week
Alternatives and similar repositories for OmniParser:
Users that are interested in OmniParser are comparing it to the libraries listed below
- Make websites accessible for AI agentsโ47,091Updated this week
- Run AI Agent in your browser.โ9,853Updated this week
- ๐ฅ Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.โ31,990Updated this week
- A collection of MCP servers.โ13,804Updated this week
- Model Context Protocol Serversโ22,214Updated this week
- Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! ๐ฆฅโ35,466Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AIโ20,730Updated this week
- ๐ช Create rich visualizations with AIโ10,640Updated this week
- Use your locally running AI models to assist you in your web browsingโ5,954Updated last week
- A modular graph-based Retrieval-Augmented Generation (RAG) systemโ23,734Updated this week
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phoneโ19,004Updated 2 weeks ago
- Let AI be your browser operator.โ7,074Updated this week
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recordingโ12,698Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your pโฆโ35,836Updated this week
- ๐๐ค Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/mEkkMXFGโ33,905Updated this week
- Roo Code (prev. Roo Cline) gives you a whole dev team of AI agents in your code editor.โ8,689Updated this week
- Run your own AI cluster at home with everyday devices ๐ฑ๐ป ๐ฅ๏ธโโ26,988Updated last week
- Convert PDF to markdown + JSON quickly with high accuracyโ23,161Updated this week
- ๐๐ ใๅคงๆจกๅใ2ๅฐๆถๅฎๅ จไป0่ฎญ็ป26M็ๅฐๅๆฐGPT๏ผ๐ Train a 26M-parameter GPT from scratch in just 2h!โ16,670Updated last month
- Get your documents ready for gen AIโ24,956Updated this week
- Integrate the DeepSeek API into popular softwaresโ29,794Updated this week
- Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.โ16,285Updated last week
- โฉ Create, share, and use custom AI code assistants with our open-source IDE extensions and hub of models, rules, prompts, docs, and otherโฆโ24,790Updated this week
- Janus-Series: Unified Multimodal Understanding and Generation Modelsโ16,816Updated last month
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.โ41,451Updated this week