bytedance / UI-TARS-desktopLinks
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
☆20,078Updated last week
Alternatives and similar repositories for UI-TARS-desktop
Users that are interested in UI-TARS-desktop are comparing it to the libraries listed below
Sorting:
- ☆8,630Updated this week
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆18,492Updated last month
- Kortix – build, manage and train AI Agents.☆18,844Updated this week
- Driving all platforms UI automation with vision-based model☆10,952Updated this week
- 🖥️ Run AI Agent in your browser.☆15,336Updated 3 months ago
- Agent S: an open agentic framework that uses computers like a human☆8,806Updated last week
- DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execut…☆18,748Updated this week
- Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI O…☆11,657Updated last month
- A simple screen parsing tool towards pure vision based GUI agent☆24,041Updated 3 months ago
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆73,975Updated this week
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural l…☆3,781Updated 7 months ago
- A research prototype of a human-centered web agent☆9,297Updated last week
- No fortress, purely open ground. OpenManus is Coming.☆51,425Updated last month
- Playwright Model Context Protocol Server - Tool to automate Browsers and APIs in Claude Desktop, Cline, Cursor IDE and More 🔌☆5,079Updated last week
- FlowGram is an extensible workflow development framework with built-in canvas, form, variable, and materials that helps developers build …☆7,456Updated this week
- A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.☆5,363Updated 2 months ago
- 🚀 The fast, Pythonic way to build MCP servers and clients☆21,367Updated this week
- Playwright MCP server☆24,670Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆12,764Updated 2 months ago
- The AI Browser Automation Framework☆19,544Updated this week
- Trae Agent is an LLM-based agent for general purpose software engineering tasks.☆10,295Updated 3 months ago
- Chrome MCP Server is a Chrome extension-based Model Context Protocol (MCP) server that exposes your Chrome browser functionality to AI as…☆9,626Updated this week
- Tongyi Deep Research, the Leading Open-source Deep Research Agent☆17,658Updated last week
- An open protocol enabling communication and interoperability between opaque agentic applications.☆21,075Updated last week
- A collection of MCP servers.☆76,888Updated last week
- Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.☆14,681Updated 3 weeks ago
- Vibe Workflow Platform for Non-technical Creators.☆5,741Updated this week
- Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost …☆24,129Updated last month
- A live stream development of RL tunning for LLM agents☆3,686Updated 2 months ago
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆25,863Updated 2 months ago