AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
☆6,598Mar 19, 2025Updated last year
Alternatives and similar repositories for AppAgent
Users that are interested in AppAgent are comparing it to the libraries listed below
Sorting:
- Mobile-Agent: The Powerful GUI Agent Family☆8,238Mar 9, 2026Updated last week
- Official implementation of AppAgentX: Evolving GUI Agents as Proficient Smartphone Users☆625Apr 15, 2025Updated 11 months ago
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,728May 29, 2024Updated last year
- [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild☆4,728Nov 18, 2024Updated last year
- An Autonomous LLM Agent for Complex Task Solving☆8,517Aug 12, 2024Updated last year
- 🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming☆65,185Jan 21, 2026Updated last month
- The first real AI developer☆33,807Nov 10, 2025Updated 4 months ago
- A programming framework for agentic AI☆55,908Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆24,546Sep 12, 2025Updated 6 months ago
- The first "code-first" agent framework for seamlessly planning and executing data analytics tasks.☆6,128Feb 3, 2026Updated last month
- An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents☆5,887Sep 26, 2024Updated last year
- A natural language interface for computers☆62,780Feb 9, 2026Updated last month
- Universal memory layer for AI Agents☆50,147Updated this week
- StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation☆10,673Dec 4, 2024Updated last year
- A generalized information-seeking agent system with Large Language Models (LLMs).☆1,198Jun 19, 2024Updated last year
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.☆21,579Mar 13, 2026Updated last week
- FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…☆27,405Updated this week
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone☆24,144Mar 7, 2026Updated last week
- Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"☆459Mar 22, 2024Updated last year
- Automate browser based workflows with AI☆20,834Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆24,578Aug 12, 2024Updated last year
- MS-Agent: a lightweight framework to empower agentic execution of complex tasks☆4,073Updated this week
- ChatDev 2.0: Dev All through LLM-powered Multi-Agent Collaboration☆31,725Updated this week
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆36,085Apr 19, 2025Updated 11 months ago
- An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation☆860Dec 27, 2023Updated 2 years ago
- Pioneering Automated GUI Interaction with Native Agents☆9,928Jan 27, 2026Updated last month
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆46,408Updated this week
- Production-ready platform for agentic workflow development.☆132,828Updated this week
- 🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org☆16,392Updated this week
- AIOS: AI Agent Operating System☆5,324Jan 22, 2026Updated last month
- A framework to enable multimodal models to operate a computer.☆10,189Sep 19, 2025Updated 6 months ago
- Universal LLM Deployment Engine with ML Compilation☆22,246Updated this week
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,483Oct 31, 2023Updated 2 years ago
- Building a quick conversation-based search demo with Lepton AI.☆8,109Dec 2, 2025Updated 3 months ago
- Question and Answer based on Anything.☆13,887Mar 24, 2025Updated 11 months ago
- open-source agentic AI data assistant for the next generation of AI + Data products.☆18,284Updated this week
- FaceChain is a deep-learning toolchain for generating your Digital-Twin.☆9,504Jun 6, 2025Updated 9 months ago
- ☆8,690Oct 9, 2024Updated last year
- The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling st…☆2,472Nov 7, 2024Updated last year