bytedance / UI-TARSLinks
Pioneering Automated GUI Interaction with Native Agents
☆8,758Updated last week
Alternatives and similar repositories for UI-TARS
Users that are interested in UI-TARS are comparing it to the libraries listed below
Sorting:
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra☆20,394Updated this week
- A live stream development of RL tunning for LLM agents☆3,751Updated 2 months ago
- Agent S: an open agentic framework that uses computers like a human☆9,211Updated 3 weeks ago
- Kortix – build, manage and train AI Agents.☆19,002Updated this week
- Driving all platforms UI automation with vision-based model☆11,137Updated this week
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆3,283Updated 6 months ago
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆12,863Updated 3 months ago
- DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execut…☆18,920Updated this week
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural l…☆3,791Updated 7 months ago
- A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.☆5,367Updated 3 months ago
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆18,542Updated this week
- Out-of-the-box (OOTB) GUI Agent for Windows and macOS☆1,854Updated 7 months ago
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆7,288Updated last month
- Task-Aware Agent-driven Prompt Optimization Framework☆3,732Updated 2 months ago
- 🖥️ Run AI Agent in your browser.☆15,374Updated 4 months ago
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.☆1,624Updated 7 months ago
- Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai☆4,827Updated last week
- Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)☆5,043Updated 3 weeks ago
- Train your AI self, amplify you, bridge the world☆14,946Updated 3 months ago
- DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solvin…☆3,029Updated 3 months ago
- Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.☆14,788Updated last month
- An open-sourced end-to-end VLM-based GUI Agent☆1,113Updated 9 months ago
- A research prototype of a human-centered web agent☆9,544Updated 2 weeks ago
- ☆3,468Updated 10 months ago
- Tongyi Deep Research, the Leading Open-source Deep Research Agent☆17,801Updated last week
- Toolkit for linearizing PDFs for LLM datasets/training☆16,582Updated this week
- Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and pe…☆3,863Updated 6 months ago
- Build Real-Time Knowledge Graphs for AI Agents☆21,553Updated last week
- II-Agent: a new open-source framework to build and deploy intelligent agents☆3,079Updated last month
- ☆10,082Updated 4 months ago