TencentQQGYLab / AppAgent
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
☆5,642Updated last week
Alternatives and similar repositories for AppAgent:
Users that are interested in AppAgent are comparing it to the libraries listed below
- Mobile-Agent: The Powerful Mobile Device Operation Assistant Family☆3,908Updated last week
- A code-first agent framework for seamlessly planning and executing data analytics tasks.☆5,615Updated last week
- [ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.☆1,821Updated 2 months ago
- [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild☆4,194Updated 4 months ago
- An Autonomous LLM Agent for Complex Task Solving☆8,257Updated 7 months ago
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,230Updated 2 months ago
- A UI-Focused Agent for Windows OS Interaction.☆6,639Updated this week
- [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.☆4,942Updated 4 months ago
- Large World Model -- Modeling Text and Video with Millions Context☆7,258Updated 5 months ago
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,438Updated 9 months ago
- AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI☆1,033Updated 3 months ago
- Building a quick conversation-based search demo with Lepton AI.☆8,045Updated 2 weeks ago
- A series of large language models trained from scratch by developers @01-ai☆7,832Updated 4 months ago
- AIOS: AI Agent Operating System☆3,992Updated this week
- ☆9,492Updated 7 months ago
- MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.☆7,275Updated 4 months ago
- GPT4V-level open-source multi-modal model based on Llama3-8B☆2,315Updated 3 weeks ago
- 🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides …☆4,435Updated 6 months ago
- Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins☆2,776Updated last year
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆8,359Updated this week
- 👾 Open source implementation of the ChatGPT Code Interpreter☆3,832Updated 4 months ago
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆6,316Updated last week
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,873Updated 5 months ago
- A generalized information-seeking agent system with Large Language Models (LLMs).☆1,140Updated 9 months ago
- The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling st…☆2,042Updated 4 months ago
- The open source platform for AI-native application development.☆5,080Updated 3 months ago
- Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).☆6,826Updated last month
- An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation☆833Updated last year
- An LLM-based Web Navigating Agent (KDD'24)☆828Updated 6 months ago
- 🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org☆10,993Updated this week