bytedance / UI-TARS-desktopLinks
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
β19,041Updated this week
Alternatives and similar repositories for UI-TARS-desktop
Users that are interested in UI-TARS-desktop are comparing it to the libraries listed below
Sorting:
- β7,827Updated this week
- π¦ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automationβ18,158Updated last week
- Kortix β build, manage and train AI Agents. Fully Open Source.β18,191Updated this week
- No fortress, purely open ground. OpenManus is Coming.β50,076Updated this week
- DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python executβ¦β17,228Updated this week
- Your AI Operator for Web, Android, Automation & Testing.β10,402Updated this week
- π₯οΈ Run AI Agent in your browser.β14,957Updated last month
- Agent S: an open agentic framework that uses computers like a humanβ6,306Updated last month
- A simple screen parsing tool towards pure vision based GUI agentβ23,603Updated 3 weeks ago
- Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Oβ¦β9,820Updated last week
- Train your AI self, amplify you, bridge the worldβ14,331Updated 2 weeks ago
- A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.β5,325Updated 4 months ago
- π Make websites accessible for AI agents. Automate tasks online with ease.β70,580Updated this week
- Roo Code gives you a whole dev team of AI agents in your code editor.β19,922Updated this week
- A research prototype of a human-centered web agentβ7,724Updated this week
- A lightweight, powerful framework for multi-agent workflowsβ15,132Updated this week
- Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.aiβ4,639Updated this week
- Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost β¦β22,017Updated 2 weeks ago
- A live stream development of RL tunning for LLM agentsβ3,482Updated last week
- FlowGram is a node-based flow building engine that helps developers quickly create workflows in either fixed layout or free connection laβ¦β6,854Updated this week
- π The fast, Pythonic way to build MCP servers and clientsβ18,373Updated this week
- Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containerizβ¦β8,143Updated 3 weeks ago
- An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Cβ¦β17,162Updated this week
- ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNβ53,894Updated last week
- The AI Browser Automation Frameworkβ17,236Updated last week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.β11,794Updated last week
- Trae Agent is an LLM-based agent for general purpose software engineering tasks.β9,592Updated last week
- "AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"β7,396Updated last month
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.β7,007Updated 2 months ago
- Chrome MCP Server is a Chrome extension-based Model Context Protocol (MCP) server that exposes your Chrome browser functionality to AI asβ¦β8,314Updated last month