bytedance / UI-TARSLinks
☆8,171Updated last week
Alternatives and similar repositories for UI-TARS
Users that are interested in UI-TARS are comparing it to the libraries listed below
Sorting:
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra☆19,455Updated this week
- Agent S: an open agentic framework that uses computers like a human☆7,981Updated last week
- A live stream development of RL tunning for LLM agents☆3,591Updated last month
- Your AI Operator for Web, Android, Automation & Testing.☆10,606Updated last week
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆18,312Updated last month
- 🖥️ Run AI Agent in your browser.☆15,128Updated 2 months ago
- DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solvin…☆2,858Updated last month
- Task-Aware Agent-driven Prompt Optimization Framework☆3,668Updated 3 weeks ago
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural l…☆3,741Updated 5 months ago
- Out-of-the-box (OOTB) GUI Agent for Windows and macOS☆1,819Updated 5 months ago
- Kortix – build, manage and train AI Agents. Fully Open Source.☆18,550Updated this week
- Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)☆4,979Updated last month
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆7,127Updated 4 months ago
- DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execut…☆17,945Updated this week
- The Open-Source Agentic Workspace for Human-AI Collaboration.☆4,783Updated this week
- A research prototype of a human-centered web agent☆7,922Updated last week
- Build effective agents using Model Context Protocol and simple workflow patterns☆7,717Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆12,220Updated last month
- Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI O…☆11,243Updated this week
- A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.☆5,353Updated last month
- Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai☆4,723Updated last week
- II-Agent: a new open-source framework to build and deploy intelligent agents☆2,945Updated 2 months ago
- SOTA search powered LLM☆3,719Updated 7 months ago
- A mini, open-weights, version of our Proxy assistant.☆970Updated 8 months ago
- A simple screen parsing tool towards pure vision based GUI agent☆23,813Updated last month
- 🔥 Official Firecrawl MCP Server - Adds powerful web scraping and search to Cursor, Claude and any other LLM clients.☆4,889Updated this week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆72,307Updated this week
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.☆1,539Updated 5 months ago
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆3,236Updated 4 months ago
- An open-sourced end-to-end VLM-based GUI Agent☆1,082Updated 7 months ago