TencentQQGYLab / AppAgentLinks
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
☆5,887Updated 3 months ago
Alternatives and similar repositories for AppAgent
Users that are interested in AppAgent are comparing it to the libraries listed below
Sorting:
- Mobile-Agent: The Powerful Mobile Device Operation Assistant Family☆4,341Updated 2 weeks ago
- [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild☆4,340Updated 7 months ago
- An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents☆5,628Updated 8 months ago
- The Desktop AgentOS.☆7,384Updated last week
- 🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides …☆4,623Updated 9 months ago
- The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling st…☆2,112Updated 7 months ago
- A code-first agent framework for seamlessly planning and executing data analytics tasks.☆5,764Updated last month
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.☆1,662Updated 9 months ago
- ModelScope-Agent: An agent framework connecting models in ModelScope with the world☆3,192Updated this week
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,394Updated 5 months ago
- An Autonomous LLM Agent for Complex Task Solving☆8,371Updated 10 months ago
- MiniCPM4: Ultra-Efficient LLMs on End Devices, achieving 5+ speedup on typical end-side chips☆7,951Updated last week
- An LLM-based Web Navigating Agent (KDD'24)☆865Updated 8 months ago
- A framework to enable multimodal models to operate a computer.☆9,715Updated last month
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,589Updated last year
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,683Updated 3 months ago
- An open-sourced end-to-end VLM-based GUI Agent☆970Updated 2 months ago
- A generalized information-seeking agent system with Large Language Models (LLMs).☆1,166Updated last year
- AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI☆1,042Updated 6 months ago
- ☆6,641Updated 4 months ago
- A framework for prompt tuning using Intent-based Prompt Calibration☆2,607Updated 2 months ago
- [ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.☆1,835Updated 5 months ago
- [IJCAI 2024] Generate different roles for GPTs to form a collaborative entity for complex tasks.☆1,367Updated last year
- AIOS: AI Agent Operating System☆4,260Updated last week
- Question and Answer based on Anything.☆13,266Updated 2 months ago
- A series of large language models trained from scratch by developers @01-ai☆7,830Updated 6 months ago
- 🧙AutoDev: The AI-powered coding wizard(AI 驱动编程助手)with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying ass…☆3,977Updated this week
- Large World Model -- Modeling Text and Video with Millions Context☆7,293Updated 8 months ago
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone☆19,629Updated last week
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…☆753Updated 4 months ago