bytedance / UI-TARS-desktopLinks
A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.
β14,662Updated this week
Alternatives and similar repositories for UI-TARS-desktop
Users that are interested in UI-TARS-desktop are comparing it to the libraries listed below
Sorting:
- β6,453Updated last month
- π¦ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automationβ17,060Updated last week
- No fortress, purely open ground. OpenManus is Coming.β46,905Updated this week
- Your AI Operator for Web, Android, Automation & Testing.β9,240Updated this week
- Suna - Open Source Generalist AI Agentβ14,460Updated this week
- Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Oβ¦β6,835Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.β9,608Updated this week
- π₯οΈ Run AI Agent in your browser.β13,754Updated 2 weeks ago
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural lβ¦β3,486Updated last month
- A simple screen parsing tool towards pure vision based GUI agentβ22,426Updated 2 months ago
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.β6,274Updated 3 weeks ago
- DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python executβ¦β13,606Updated this week
- A research prototype of a human-centered web agentβ5,276Updated this week
- A lightweight, powerful framework for multi-agent workflowsβ11,537Updated this week
- A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.β5,192Updated last month
- Build Real-Time Knowledge Graphs for AI Agentsβ11,432Updated this week
- A live stream development of RL tunning for LLM agentsβ3,022Updated 3 weeks ago
- π The fast, Pythonic way to build MCP servers and clientsβ12,692Updated this week
- π Make websites accessible for AI agents. Automate tasks online with ease.β63,261Updated this week
- ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNβ45,591Updated last week
- π« CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.orgβ12,919Updated this week
- π¬DeepChat - A smart assistant that connects powerful AI to your personal worldβ3,484Updated this week
- Agent S: an open agentic framework that uses computers like a humanβ5,486Updated this week
- Use your locally running AI models to assist you in your web browsingβ6,705Updated this week
- FlowGram is a node-based flow building engine that helps developers quickly create workflows in either fixed layout or free connection laβ¦β5,342Updated this week
- Fully local web research and report writing assistantβ7,601Updated 2 months ago
- δΈζ¬Ύζη€Ίθ―δΌεε¨οΌε©εδΊηΌει«θ΄¨ιηζη€Ίθ―β6,544Updated this week
- Get started with building Fullstack Agents using Gemini 2.5 and LangGraphβ13,266Updated last week
- π₯ Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.β40,227Updated this week
- Memory for AI Agents; Announcing OpenMemory MCP - local and secure memory management.β34,513Updated this week