The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
β28,815Mar 10, 2026Updated this week
Alternatives and similar repositories for UI-TARS-desktop
Users that are interested in UI-TARS-desktop are comparing it to the libraries listed below
Sorting:
- Pioneering Automated GUI Interaction with Native Agentsβ9,875Jan 27, 2026Updated last month
- π Make websites accessible for AI agents. Automate tasks online with ease.β80,443Updated this week
- No fortress, purely open ground. OpenManus is Coming.β55,205Feb 11, 2026Updated last month
- Kortix β build, manage and train AI Agents.β19,497Updated this week
- A simple screen parsing tool towards pure vision based GUI agentβ24,503Sep 12, 2025Updated 6 months ago
- AI-powered, vision-driven UI automation for every platform.β12,160Updated this week
- π¦ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automationβ19,188Mar 4, 2026Updated last week
- Production-ready platform for agentic workflow development.β132,828Updated this week
- An open-source SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skills and subagents,β¦β29,488Updated this week
- Universal memory layer for AI Agentsβ49,365Updated this week
- π₯ The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured dataβ93,251Updated this week
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creatβ¦β74,968Updated this week
- A collection of MCP servers.β82,625Updated this week
- π OpenHands: AI-Driven Developmentβ68,865Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your pβ¦β58,987Updated this week
- ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNβ61,687Mar 9, 2026Updated last week
- Agent S: an open agentic framework that uses computers like a humanβ10,137Feb 21, 2026Updated 3 weeks ago
- A programming framework for agentic AIβ55,559Updated this week
- π The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programmingβ65,185Jan 21, 2026Updated last month
- Build, run, manage agentic software at scale.β38,700Updated this week
- The ultimate space for work and life β to find, build, and collaborate with agent teammates that grow with you. We are taking agent harneβ¦β73,318Mar 9, 2026Updated last week
- AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMsβ41,211Updated this week
- Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ inβ¦β178,915Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.β15,597Mar 4, 2026Updated last week
- FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processβ¦β27,339Updated this week
- Model Context Protocol Serversβ81,125Mar 7, 2026Updated last week
- π₯οΈ Run AI Agent in your browser.β15,687Aug 31, 2025Updated 6 months ago
- Build AI Agents, Visuallyβ50,762Updated this week
- Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Oβ¦β12,430Nov 24, 2025Updated 3 months ago
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)β126,337Mar 9, 2026Updated last week
- The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configration.β56,228Updated this week
- The AI Browser Automation Frameworkβ21,465Updated this week
- Python tool for converting files and office documents to Markdown.β90,728Updated this week
- π« CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.orgβ16,292Updated this week
- Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost β¦β25,499Mar 2, 2026Updated last week
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.β55,756Mar 7, 2026Updated last week
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work tβ¦β45,821Updated this week
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phoneβ24,094Mar 7, 2026Updated last week
- FlowGram is an extensible workflow development framework with built-in canvas, form, variable, and materials that helps developers build β¦β7,770Updated this week