bytedance / UI-TARS-desktopLinks
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
β18,602Updated this week
Alternatives and similar repositories for UI-TARS-desktop
Users that are interested in UI-TARS-desktop are comparing it to the libraries listed below
Sorting:
- β7,614Updated last week
- π¦ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automationβ18,028Updated last week
- A simple screen parsing tool towards pure vision based GUI agentβ23,475Updated 3 weeks ago
- Your AI Operator for Web, Android, Automation & Testing.β10,276Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.β11,403Updated last month
- No fortress, purely open ground. OpenManus is Coming.β49,585Updated this week
- Kortix β build, manage and train AI Agents. Fully Open Source.β17,932Updated this week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.β6,922Updated 2 months ago
- Agent S: an open agentic framework that uses computers like a humanβ6,244Updated 3 weeks ago
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural lβ¦β3,683Updated 4 months ago
- π₯οΈ Run AI Agent in your browser.β14,846Updated last week
- DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python executβ¦β16,858Updated this week
- π Make websites accessible for AI agents. Automate tasks online with ease.β69,439Updated this week
- Train your AI self, amplify you, bridge the worldβ14,166Updated 3 weeks ago
- Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost β¦β21,802Updated 2 months ago
- Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.β13,207Updated last month
- A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.β5,312Updated 3 months ago
- A live stream development of RL tunning for LLM agentsβ3,420Updated this week
- An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Cβ¦β16,553Updated this week
- A research prototype of a human-centered web agentβ7,637Updated this week
- The Open-Source Agentic Workspace for Human-AI Collaboration.β4,650Updated this week
- π The fast, Pythonic way to build MCP servers and clientsβ17,468Updated this week
- An open protocol enabling communication and interoperability between opaque agentic applications.β19,667Updated this week
- The ultimate LLM/AI application development framework in Golang.β7,179Updated this week
- FlowGram is a node-based flow building engine that helps developers quickly create workflows in either fixed layout or free connection laβ¦β6,699Updated this week
- π¬DeepChat - A smart assistant that connects powerful AI to your personal worldβ4,000Updated this week
- π WebAgent for Information Seeking built by Tongyi Lab: WebWalker & WebDancer & WebSailor & WebShaper & WebWatcher https://arxiv.org/absβ¦β6,466Updated last week
- Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Oβ¦β8,961Updated last week
- δΈζ¬Ύζη€Ίθ―δΌεε¨οΌε©εδΊηΌει«θ΄¨ιηζη€Ίθ―β14,553Updated 3 weeks ago
- Fine-tuning & Reinforcement Learning for LLMs. π¦₯ Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% lessβ¦β45,177Updated last week