Pioneering Automated GUI Interaction with Native Agents
☆9,712Jan 27, 2026Updated last month
Alternatives and similar repositories for UI-TARS
Users that are interested in UI-TARS are comparing it to the libraries listed below
Sorting:
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra☆28,399Updated this week
- Driving all platforms UI automation with vision-based model☆11,852Updated this week
- Agent S: an open agentic framework that uses computers like a human☆9,912Feb 21, 2026Updated last week
- A simple screen parsing tool towards pure vision based GUI agent☆24,406Sep 12, 2025Updated 5 months ago
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆79,028Updated this week
- Kortix – build, manage and train AI Agents.☆19,418Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆13,451Feb 16, 2026Updated 2 weeks ago
- Universal memory layer for AI Agents☆47,994Feb 23, 2026Updated last week
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆19,129Updated this week
- No fortress, purely open ground. OpenManus is Coming.☆54,814Feb 11, 2026Updated 2 weeks ago
- 🙌 OpenHands: AI-Driven Development☆68,154Updated this week
- An open-source SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skills and subagents,…☆20,843Updated this week
- A programming framework for agentic AI☆54,956Jan 22, 2026Updated last month
- Build Real-Time Knowledge Graphs for AI Agents☆23,192Updated this week
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone☆23,942Feb 23, 2026Updated last week
- 🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org☆16,104Updated this week
- Mobile-Agent: The Powerful GUI Agent Family☆7,338Updated this week
- Build, run, manage agentic software at scale.☆38,276Updated this week
- The AI Browser Automation Framework☆21,261Updated this week
- 🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming☆64,648Jan 21, 2026Updated last month
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆58,536Updated this week
- 🖥️ Run AI Agent in your browser.☆15,612Aug 31, 2025Updated 6 months ago
- Production-ready platform for agentic workflow development.☆130,750Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆52,724Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆71,234Updated this week
- Automate browser based workflows with AI☆20,530Updated this week
- Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆18,386Jan 30, 2026Updated last month
- Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full…☆12,761Updated this week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆60,971Updated this week
- 🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data☆87,163Updated this week
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆44,662Updated this week
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆26,713Jan 9, 2026Updated last month
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆37,083Updated this week
- 🚀 The fast, Pythonic way to build MCP servers and clients.☆23,221Updated this week
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆2,608Updated this week
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆21,026Mar 11, 2025Updated 11 months ago
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.☆1,719Jan 20, 2026Updated last month
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…☆73,900Updated this week
- An open-sourced end-to-end VLM-based GUI Agent☆1,138Apr 4, 2025Updated 10 months ago