Pioneering Automated GUI Interaction with Native Agents
☆11,053Jan 27, 2026Updated 5 months ago
Alternatives and similar repositories for UI-TARS
Users that are interested in UI-TARS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra☆37,224Jun 18, 2026Updated last week
- AI-powered, vision-driven UI automation for every platform.☆13,863Updated this week
- Agent S: an open agentic framework that uses computers like a human☆11,928May 13, 2026Updated last month
- A simple screen parsing tool towards pure vision based GUI agent☆24,981Apr 13, 2026Updated 2 months ago
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆101,585Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The Company AI Command Center☆19,900Updated this week
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆19,879Jun 23, 2026Updated last week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆16,619Mar 4, 2026Updated 3 months ago
- Universal memory layer for AI Agents☆59,728Updated this week
- No fortress, purely open ground. OpenManus is Coming.☆56,670Feb 11, 2026Updated 4 months ago
- Mobile-Agent: The Powerful GUI Agent Family☆8,892May 14, 2026Updated last month
- 🙌 OpenHands: AI-Driven Development☆78,644Updated this week
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.☆1,871Apr 24, 2026Updated 2 months ago
- A programming framework for agentic AI☆59,261Apr 15, 2026Updated 2 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Build Real-Time Knowledge Graphs for AI Agents☆28,071Updated this week
- The SDK For Browser Agents☆23,297Updated this week
- 🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org☆17,305Updated this week
- A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone☆25,717Jun 24, 2026Updated last week
- 🖥️ Run AI Agent in your browser.☆16,141May 15, 2026Updated last month
- Autonomous coding agent as an SDK, IDE extension, or CLI assistant.☆63,963Updated this week
- 🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming☆69,068Jan 21, 2026Updated 5 months ago
- An open-sourced end-to-end VLM-based GUI Agent☆1,184Apr 4, 2025Updated last year
- Build, run, and manage agent platforms.☆40,861Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆2,970Jun 24, 2026Updated last week
- Production-ready platform for agentic workflow development.☆146,705Updated this week
- An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, s…☆75,669Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆84,877Updated this week
- Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.☆67,571Updated this week
- Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full…☆19,035Updated this week
- Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆19,499Jan 30, 2026Updated 5 months ago
- Automate browser based workflows with AI☆22,041Updated this week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆70,185Updated this week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 🚀 The fast, Pythonic way to build MCP servers and clients.☆25,786Jun 24, 2026Updated last week
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆27,333Jan 9, 2026Updated 5 months ago
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆54,621Updated this week
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework☆22,173Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆51,475Updated this week
- AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.☆6,788Mar 19, 2025Updated last year
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆452Apr 20, 2025Updated last year