Pioneering Automated GUI Interaction with Native Agents
☆10,065Jan 27, 2026Updated 2 months ago
Alternatives and similar repositories for UI-TARS
Users that are interested in UI-TARS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra☆29,356Mar 27, 2026Updated 2 weeks ago
- AI-powered, vision-driven UI automation for every platform.☆12,574Updated this week
- Agent S: an open agentic framework that uses computers like a human☆10,812Feb 21, 2026Updated last month
- A simple screen parsing tool towards pure vision based GUI agent☆24,619Sep 12, 2025Updated 6 months ago
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆86,467Updated this week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- The Autonomous Company Operating System☆19,571Updated this week
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆19,607Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆15,958Mar 4, 2026Updated last month
- Universal memory layer for AI Agents☆52,137Updated this week
- No fortress, purely open ground. OpenManus is Coming.☆55,682Feb 11, 2026Updated 2 months ago
- Mobile-Agent: The Powerful GUI Agent Family☆8,408Mar 31, 2026Updated last week
- 🙌 OpenHands: AI-Driven Development☆70,666Updated this week
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.☆1,766Jan 20, 2026Updated 2 months ago
- A programming framework for agentic AI☆56,900Updated this week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Build Real-Time Knowledge Graphs for AI Agents☆24,507Apr 5, 2026Updated last week
- The SDK For Browser Agents☆21,897Updated this week
- 🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org☆16,619Apr 4, 2026Updated last week
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone☆24,322Apr 1, 2026Updated last week
- 🖥️ Run AI Agent in your browser.☆15,811Aug 31, 2025Updated 7 months ago
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆59,890Apr 4, 2026Updated last week
- 🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming☆66,626Jan 21, 2026Updated 2 months ago
- An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, s…☆59,375Updated this week
- Build, run, manage agentic software at scale.☆39,343Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- An open-sourced end-to-end VLM-based GUI Agent☆1,167Apr 4, 2025Updated last year
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆2,757Apr 2, 2026Updated last week
- Production-ready platform for agentic workflow development.☆137,197Updated this week
- Unsloth Studio is a web UI for training and running open models like Qwen3.5, Gemma 4, DeepSeek, gpt-oss locally.☆59,774Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆75,637Updated this week
- Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full…☆13,436Updated this week
- Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆18,917Jan 30, 2026Updated 2 months ago
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆63,500Updated this week
- Automate browser based workflows with AI☆21,068Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 🚀 The fast, Pythonic way to build MCP servers and clients.☆24,408Updated this week
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆48,311Updated this week
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆27,088Jan 9, 2026Updated 3 months ago
- verl: Volcano Engine Reinforcement Learning for LLMs☆20,443Apr 3, 2026Updated last week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆42,652Updated this week
- 🔥 The Web Data API for AI - Power AI agents with clean web data☆104,217Apr 4, 2026Updated last week
- AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.☆6,654Mar 19, 2025Updated last year