bytedance / UI-TARS-desktopLinks
The Open-sourced Multimodal AI Agent Stack connecting Cutting-edge AI Models and Agent Infra.
β15,374Updated this week
Alternatives and similar repositories for UI-TARS-desktop
Users that are interested in UI-TARS-desktop are comparing it to the libraries listed below
Sorting:
- β6,767Updated 2 months ago
- π¦ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automationβ17,646Updated last week
- Suna - Open Source Generalist AI Agentβ16,998Updated this week
- A simple screen parsing tool towards pure vision based GUI agentβ22,933Updated 4 months ago
- No fortress, purely open ground. OpenManus is Coming.β48,422Updated last week
- Your AI Operator for Web, Android, Automation & Testing.β9,757Updated this week
- Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Oβ¦β8,039Updated this week
- π₯οΈ Run AI Agent in your browser.β14,404Updated 2 months ago
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural lβ¦β3,627Updated 2 months ago
- DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python executβ¦β15,694Updated last week
- π Make websites accessible for AI agents. Automate tasks online with ease.β66,313Updated this week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.β6,681Updated 3 weeks ago
- Agent S: an open agentic framework that uses computers like a humanβ5,905Updated 2 weeks ago
- A live stream development of RL tunning for LLM agentsβ3,249Updated 2 weeks ago
- Train your AI self, amplify you, bridge the worldβ13,630Updated this week
- FlowGram is a node-based flow building engine that helps developers quickly create workflows in either fixed layout or free connection laβ¦β6,183Updated this week
- Toolkit for linearizing PDFs for LLM datasets/trainingβ13,346Updated this week
- A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.β5,289Updated 2 months ago
- A visual playground for agentic workflows: Iterate over your agents 10x fasterβ5,315Updated last week
- A research prototype of a human-centered web agentβ6,627Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.β10,511Updated last week
- π₯ Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.β43,735Updated this week
- The world's first open-source "Vibe Workflow" platform for complex tasks.β4,395Updated this week
- An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Cβ¦β4,410Updated this week
- Model Context Protocol Serversβ62,052Updated this week
- Playwright MCP serverβ15,667Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your pβ¦β48,375Updated this week
- π The fast, Pythonic way to build MCP servers and clientsβ15,295Updated this week
- π₯ Official Firecrawl MCP Server - Adds powerful web scraping to Cursor, Claude and any other LLM clients.β3,934Updated last week
- π¬DeepChat - A smart assistant that connects powerful AI to your personal worldβ3,737Updated this week