bytedance / UI-TARSLinks
Pioneering Automated GUI Interaction with Native Agents
☆9,060Updated last week
Alternatives and similar repositories for UI-TARS
Users that are interested in UI-TARS are comparing it to the libraries listed below
Sorting:
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra☆24,755Updated last week
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆18,943Updated this week
- Agent S: an open agentic framework that uses computers like a human☆9,608Updated last week
- Driving all platforms UI automation with vision-based model☆11,420Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆24,265Updated 4 months ago
- A live stream development of RL tunning for LLM agents☆3,839Updated 3 months ago
- Kortix – build, manage and train AI Agents.☆19,227Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆13,053Updated this week
- Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)☆5,065Updated last month
- 🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org☆15,727Updated this week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆7,517Updated 2 months ago
- Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI O…☆12,050Updated 2 months ago
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆3,305Updated 6 months ago
- Fully local web research and report writing assistant☆8,477Updated 5 months ago
- A lightweight, powerful framework for multi-agent workflows☆18,524Updated this week
- No fortress, purely open ground. OpenManus is Coming.☆53,529Updated 3 weeks ago
- 🔥 Official Firecrawl MCP Server - Adds powerful web scraping and search to Cursor, Claude and any other LLM clients.☆5,313Updated last week
- 🖥️ Run AI Agent in your browser.☆15,484Updated 4 months ago
- Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai☆4,845Updated last week
- Out-of-the-box (OOTB) GUI Agent for Windows and macOS☆1,868Updated 8 months ago
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural l…☆3,821Updated 8 months ago
- "AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"☆8,480Updated 3 months ago
- ☆10,303Updated 5 months ago
- An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl☆6,155Updated 8 months ago
- DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execut…☆19,366Updated this week
- ⚙️ Create and run workflows (RPA 2.0)☆3,855Updated last week
- Playwright Model Context Protocol Server - Tool to automate Browsers and APIs in Claude Desktop, Cline, Cursor IDE and More 🔌☆5,173Updated last month
- Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.☆14,944Updated last month
- Task-Aware Agent-driven Prompt Optimization Framework☆3,742Updated 3 months ago
- An open protocol enabling communication and interoperability between opaque agentic applications.☆21,618Updated this week