bytedance / UI-TARS-desktopLinks
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
โ27,325Updated 3 weeks ago
Alternatives and similar repositories for UI-TARS-desktop
Users that are interested in UI-TARS-desktop are comparing it to the libraries listed below
Sorting:
- Pioneering Automated GUI Interaction with Native Agentsโ9,343Updated 2 weeks ago
- ๐ฆ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automationโ19,034Updated last week
- A simple screen parsing tool towards pure vision based GUI agentโ24,344Updated 5 months ago
- ๐ฅ๏ธ Run AI Agent in your browser.โ15,588Updated 5 months ago
- Driving all platforms UI automation with vision-based modelโ11,647Updated this week
- ๐ Make websites accessible for AI agents. Automate tasks online with ease.โ77,901Updated last week
- Kortix โ build, manage and train AI Agents.โ19,325Updated this week
- No fortress, purely open ground. OpenManus is Coming.โ54,333Updated last month
- Agent S: an open agentic framework that uses computers like a humanโ9,713Updated 3 weeks ago
- Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Oโฆโ12,162Updated 2 months ago
- A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.โ5,370Updated 4 months ago
- The first open-source agent skills builder. Define skills by vibe workflow, run on Claude Code, Cursor, Codex & more. Build Clawdbot ๐ฆยท โฆโ6,486Updated last week
- DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python executโฆโ19,658Updated last week
- LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.โ12,913Updated last week
- FlowGram is an extensible workflow development framework with built-in canvas, form, variable, and materials that helps developers build โฆโ7,674Updated 3 weeks ago
- Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost โฆโ24,939Updated 2 months ago
- A collection of MCP servers.โ80,690Updated last week
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural lโฆโ3,828Updated 8 months ago
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.โ13,234Updated last week
- Train your AI self, amplify you, bridge the worldโ15,087Updated 4 months ago
- Agent2Agent (A2A) is an open protocol enabling communication and interoperability between opaque agentic applications.โ21,844Updated this week
- An Open Source implementation of Notebook LM with more flexibility and featuresโ19,410Updated this week
- ๐ The fast, Pythonic way to build MCP servers and clientsโ22,675Updated this week
- ๐๐ค Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNโ59,947Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing aโฆโ35,429Updated this week
- A research prototype of a human-centered web agentโ9,632Updated 3 weeks ago
- Playwright MCP serverโ26,849Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your pโฆโ57,756Updated this week
- "AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"โ8,540Updated 3 months ago
- Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.aiโ4,856Updated 3 weeks ago