bytedance / UI-TARSLinks
☆8,297Updated 3 weeks ago
Alternatives and similar repositories for UI-TARS
Users that are interested in UI-TARS are comparing it to the libraries listed below
Sorting:
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra☆19,649Updated this week
- Agent S: an open agentic framework that uses computers like a human☆8,390Updated last month
- Your AI Operator for Web, Android, Automation & Testing.☆10,743Updated this week
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural l…☆3,769Updated 6 months ago
- A live stream development of RL tunning for LLM agents☆3,633Updated last month
- Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)☆5,015Updated last month
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆18,401Updated last week
- DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execut…☆18,401Updated this week
- 🖥️ Run AI Agent in your browser.☆15,248Updated 3 months ago
- Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI O…☆11,437Updated last week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆12,476Updated 2 months ago
- A research prototype of a human-centered web agent☆8,367Updated this week
- Kortix – build, manage and train AI Agents. Fully Open Source.☆18,674Updated this week
- Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai☆4,759Updated last week
- Task-Aware Agent-driven Prompt Optimization Framework☆3,693Updated last month
- A simple screen parsing tool towards pure vision based GUI agent☆23,930Updated 2 months ago
- Playwright Model Context Protocol Server - Tool to automate Browsers and APIs in Claude Desktop, Cline, Cursor IDE and More 🔌☆5,019Updated last month
- ☆9,759Updated 3 months ago
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆7,206Updated last week
- DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solvin…☆2,924Updated 2 months ago
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆3,250Updated 4 months ago
- Out-of-the-box (OOTB) GUI Agent for Windows and macOS☆1,833Updated 6 months ago
- [CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents☆1,867Updated last month
- 5ire is a cross-platform desktop AI assistant, MCP client. It compatible with major service providers, supports local knowledge base and…☆4,806Updated this week
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.☆1,566Updated 6 months ago
- A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.☆5,361Updated last month
- Fully local web research and report writing assistant☆8,361Updated 3 months ago
- The Open-Source Agentic Workspace for Human-AI Collaboration.☆4,874Updated this week
- Build effective agents using Model Context Protocol and simple workflow patterns☆7,811Updated this week
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆5,596Updated 4 months ago