bytedance / UI-TARSLinks
Pioneering Automated GUI Interaction with Native Agents
☆9,343Updated 2 weeks ago
Alternatives and similar repositories for UI-TARS
Users that are interested in UI-TARS are comparing it to the libraries listed below
Sorting:
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra☆27,325Updated 3 weeks ago
- Agent S: an open agentic framework that uses computers like a human☆9,713Updated 3 weeks ago
- Driving all platforms UI automation with vision-based model☆11,647Updated this week
- A live stream development of RL tunning for LLM agents☆3,901Updated 4 months ago
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆19,034Updated this week
- Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI O…☆12,162Updated 2 months ago
- A research prototype of a human-centered web agent☆9,632Updated 2 weeks ago
- Kortix – build, manage and train AI Agents.☆19,325Updated this week
- 🖥️ Run AI Agent in your browser.☆15,562Updated 5 months ago
- A simple screen parsing tool towards pure vision based GUI agent☆24,344Updated 4 months ago
- Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai☆4,856Updated 3 weeks ago
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural l…☆3,824Updated 8 months ago
- DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solvin…☆3,102Updated 4 months ago
- DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execut…☆19,658Updated this week
- A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.☆5,370Updated 4 months ago
- ⚙️ Create and run workflows (RPA 2.0)☆3,876Updated last week
- Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containeriz…☆10,379Updated 4 months ago
- Playwright Model Context Protocol Server - Tool to automate Browsers and APIs in Claude Desktop, Cline, Cursor IDE and More 🔌☆5,210Updated last month
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆7,563Updated 2 months ago
- A mini, open-weights, version of our Proxy assistant.☆984Updated 11 months ago
- An open-sourced end-to-end VLM-based GUI Agent☆1,131Updated 10 months ago
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.☆1,697Updated 3 weeks ago
- Allow LLMs to control a browser with Browserbase and Stagehand☆3,112Updated 2 weeks ago
- Out-of-the-box (OOTB) GUI Agent for Windows and macOS☆1,883Updated 8 months ago
- MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.☆3,075Updated 7 months ago
- Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)☆5,076Updated last month
- Fully local web research and report writing assistant☆8,494Updated 6 months ago
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆3,328Updated 7 months ago
- Build, Evaluate, and Optimize AI Systems. Includes evals, RAG, agents, fine-tuning, synthetic data generation, dataset management, MCP, a…☆4,640Updated this week
- Task-Aware Agent-driven Prompt Optimization Framework☆3,753Updated 3 months ago