microsoft / OmniParserLinks
A simple screen parsing tool towards pure vision based GUI agent
☆23,326Updated this week
Alternatives and similar repositories for OmniParser
Users that are interested in OmniParser are comparing it to the libraries listed below
Sorting:
- ☆6,949Updated 3 months ago
- 🖥️ Run AI Agent in your browser.☆14,598Updated last week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆68,120Updated this week
- The Open-sourced Multimodal AI Agent Stack connecting Cutting-edge AI Models and Agent Infra.☆16,504Updated this week
- Toolkit for linearizing PDFs for LLM datasets/training☆13,781Updated this week
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆17,853Updated 3 weeks ago
- No fortress, purely open ground. OpenManus is Coming.☆49,003Updated last week
- 🪄 Create rich visualizations with AI☆13,293Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆48,597Updated this week
- Your AI Operator for Web, Android, Automation & Testing.☆9,932Updated this week
- Official inference framework for 1-bit LLMs☆20,695Updated 2 months ago
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆10,976Updated 3 weeks ago
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆51,178Updated this week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆6,809Updated last month
- Vision agent☆5,007Updated 2 weeks ago
- Kortix – build, manage and train AI Agents. Fully Open Source.☆17,598Updated this week
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording☆15,441Updated last week
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆42,085Updated this week
- Use your locally running AI models to assist you in your web browsing☆6,998Updated last week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆27,156Updated last month
- 🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org☆13,895Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆27,307Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less…☆44,314Updated this week
- The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.☆4,044Updated this week
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆5,358Updated last month
- Roo Code gives you a whole dev team of AI agents in your code editor.☆18,810Updated this week
- ☆4,190Updated this week
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆62,444Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆7,785Updated 6 months ago
- Integrate the DeepSeek API into popular softwares☆33,423Updated 3 months ago