bytedance / UI-TARS-desktop
A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.
ā13,137Updated last week
Alternatives and similar repositories for UI-TARS-desktop:
Users that are interested in UI-TARS-desktop are comparing it to the libraries listed below
- ā5,635Updated last week
- š¦ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automationā16,131Updated this week
- Suna - Open Source Generalist AI Agentā9,536Updated this week
- Your AI Operator for Web, Android, Automation & Testing.ā8,591Updated this week
- Run AI Agent in your browser.ā12,794Updated this week
- No fortress, purely open ground. OpenManus is Coming.ā45,114Updated last week
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural lā¦ā3,279Updated last month
- Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Oā¦ā5,386Updated this week
- A simple screen parsing tool towards pure vision based GUI agentā21,888Updated last month
- š„ Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.ā37,616Updated this week
- š The fast, Pythonic way to build MCP servers and clientsā8,458Updated this week
- Make websites accessible for AI agentsā58,844Updated this week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.ā5,811Updated last week
- A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.ā5,092Updated 3 months ago
- Fully local web research and report writing assistantā7,237Updated last month
- An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawlā5,480Updated 2 months ago
- Agent S: an open agentic framework that uses computers like a humanā4,219Updated this week
- A lightweight, powerful framework for multi-agent workflowsā9,840Updated this week
- š„ Open Source No Code Web Data Extraction Platform. Turn Websites To APIs & Spreadsheets With No-Code Robots In Minutes š„ā12,436Updated this week
- Agno is a lightweight library for building Agents with memory, knowledge, tools and reasoning.ā26,158Updated this week
- š« CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.orgā12,244Updated this week
- A live stream development of RL tunning for LLM agentsā2,617Updated this week
- Build Real-Time Knowledge Graphs for AI Agentsā8,122Updated this week
- The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.ā3,437Updated this week
- Magic to turn Cursor/Windsurf as 90% of Devinā5,480Updated last week
- Train your AI self, amplify you, bridge the worldā11,705Updated last week
- šš¤ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNā41,970Updated this week
- Monitor browser logs directly from Cursor and other MCP compatible IDEs.ā3,945Updated last month
- åäŗ«äøäŗå„½ēØē Dify DSL å·„ä½ęµēØļ¼čŖēØćå¦ä¹ äø¤ēøå®ć Sharing some Dify workflows.ā6,785Updated last week
- Playwright MCP serverā9,437Updated this week