bytedance / UI-TARS-desktop
A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.
β6,456Updated this week
Alternatives and similar repositories for UI-TARS-desktop:
Users that are interested in UI-TARS-desktop are comparing it to the libraries listed below
- β3,340Updated last month
- A simple screen parsing tool towards pure vision based GUI agentβ21,127Updated this week
- π¦ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automationβ14,067Updated this week
- A community-driven AI automation framework that builds upon the incredible work of the open source community. Our goal is to combine langβ¦β4,580Updated this week
- Let AI be your browser operator.β7,371Updated this week
- Toolkit for linearizing PDFs for LLM datasets/trainingβ10,379Updated last week
- A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.β4,900Updated last month
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.β4,800Updated this week
- Run AI Agent in your browser.β10,134Updated this week
- Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Oβ¦β4,336Updated this week
- Fully local web research and report writing assistantβ6,669Updated this week
- Roo Code (prev. Roo Cline) gives you a whole dev team of AI agents in your code editor.β9,228Updated this week
- Use your locally running AI models to assist you in your web browsingβ6,029Updated this week
- The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.β3,248Updated this week
- Make websites accessible for AI agentsβ49,816Updated this week
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural lβ¦β2,807Updated this week
- Magic to turn Cursor/Windsurf as 90% of Devinβ5,191Updated last week
- A lightweight, powerful framework for multi-agent workflowsβ7,079Updated this week
- A collection of MCP servers.β15,963Updated this week
- Model Context Protocol Serversβ25,092Updated this week
- πͺ Create rich visualizations with AIβ10,956Updated this week
- A visual playground for agentic workflows: Iterate over your agents 10x fasterβ3,910Updated this week
- No fortress, purely open ground. OpenManus is Coming.β39,444Updated this week
- π¨ Refly is an open-source AI-native creation engine. Its intuitive free-form canvas interface combines multi-threaded dialogues, artifacβ¦β3,030Updated this week
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercelβ¦β7,454Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your pβ¦β36,673Updated this week
- Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, mβ¦β85,661Updated this week
- A live stream development of RL tunning for LLM agentsβ1,883Updated this week
- π An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)β6,230Updated 2 months ago
- Train your AI self, amplify you, bridge the worldβ5,176Updated this week