Pioneering Automated GUI Interaction with Native Agents
☆10,164Jan 27, 2026Updated 3 months ago
Alternatives and similar repositories for UI-TARS
Users that are interested in UI-TARS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra☆29,564Updated this week
- AI-powered, vision-driven UI automation for every platform.☆12,883Updated this week
- Agent S: an open agentic framework that uses computers like a human☆11,011Feb 21, 2026Updated 2 months ago
- A simple screen parsing tool towards pure vision based GUI agent☆24,694Apr 13, 2026Updated 2 weeks ago
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆90,877Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The Autonomous Company Operating System☆19,675Updated this week
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆19,719Apr 17, 2026Updated 2 weeks ago
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆16,232Mar 4, 2026Updated last month
- Universal memory layer for AI Agents☆54,199Apr 25, 2026Updated last week
- No fortress, purely open ground. OpenManus is Coming.☆55,971Feb 11, 2026Updated 2 months ago
- Mobile-Agent: The Powerful GUI Agent Family☆8,578Apr 14, 2026Updated 2 weeks ago
- 🙌 OpenHands: AI-Driven Development☆72,145Updated this week
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.☆1,812Apr 24, 2026Updated last week
- A programming framework for agentic AI☆57,588Apr 15, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Build Real-Time Knowledge Graphs for AI Agents☆25,612Updated this week
- The SDK For Browser Agents☆22,371Updated this week
- 🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org☆16,826Apr 25, 2026Updated last week
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone☆24,460Updated this week
- 🖥️ Run AI Agent in your browser.☆15,905Aug 31, 2025Updated 8 months ago
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆61,275Updated this week
- 🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming☆67,611Jan 21, 2026Updated 3 months ago
- Run agents as production software.☆39,835Updated this week
- An open-sourced end-to-end VLM-based GUI Agent☆1,174Apr 4, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆2,817Apr 25, 2026Updated last week
- An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, s…☆64,092Updated this week
- Production-ready platform for agentic workflow development.☆139,793Updated this week
- Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.☆63,070Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆78,385Updated this week
- Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full…☆15,318Updated this week
- Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆19,075Jan 30, 2026Updated 3 months ago
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆64,650Apr 24, 2026Updated last week
- Automate browser based workflows with AI☆21,396Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 🚀 The fast, Pythonic way to build MCP servers and clients.☆24,918Updated this week
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆27,191Jan 9, 2026Updated 3 months ago
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆50,149Updated this week
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework☆21,046Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆45,153Updated this week
- AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.☆6,694Mar 19, 2025Updated last year
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆444Apr 20, 2025Updated last year