AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
☆6,611Mar 19, 2025Updated last year
Alternatives and similar repositories for AppAgent
Users that are interested in AppAgent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Mobile-Agent: The Powerful GUI Agent Family☆8,301Updated this week
- Official implementation of AppAgentX: Evolving GUI Agents as Proficient Smartphone Users☆627Apr 15, 2025Updated 11 months ago
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,734May 29, 2024Updated last year
- [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild☆4,735Nov 18, 2024Updated last year
- An Autonomous LLM Agent for Complex Task Solving☆8,522Aug 12, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming☆65,749Jan 21, 2026Updated 2 months ago
- The first real AI developer☆33,808Nov 10, 2025Updated 4 months ago
- A programming framework for agentic AI☆56,275Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆24,569Sep 12, 2025Updated 6 months ago
- The first "code-first" agent framework for seamlessly planning and executing data analytics tasks.☆6,132Updated this week
- An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents☆5,891Sep 26, 2024Updated last year
- A natural language interface for computers☆62,853Feb 9, 2026Updated last month
- Universal memory layer for AI Agents☆50,867Updated this week
- StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation☆10,676Dec 4, 2024Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- A generalized information-seeking agent system with Large Language Models (LLMs).☆1,198Jun 19, 2024Updated last year
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.☆21,783Updated this week
- FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…☆27,498Updated this week
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone☆24,189Mar 7, 2026Updated 3 weeks ago
- Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"☆463Mar 22, 2024Updated 2 years ago
- Automate browser based workflows with AI☆20,936Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆24,603Aug 12, 2024Updated last year
- MS-Agent: a lightweight framework to empower agentic execution of complex tasks☆4,095Updated this week
- ChatDev 2.0: Dev All through LLM-powered Multi-Agent Collaboration☆31,868Updated this week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆36,136Apr 19, 2025Updated 11 months ago
- An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation☆860Dec 27, 2023Updated 2 years ago
- Pioneering Automated GUI Interaction with Native Agents☆9,987Jan 27, 2026Updated 2 months ago
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆47,139Updated this week
- Production-ready platform for agentic workflow development.☆134,783Updated this week
- 🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org☆16,479Updated this week
- AIOS: AI Agent Operating System☆5,375Jan 22, 2026Updated 2 months ago
- A framework to enable multimodal models to operate a computer.☆10,204Sep 19, 2025Updated 6 months ago
- Universal LLM Deployment Engine with ML Compilation☆22,282Updated this week
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,484Oct 31, 2023Updated 2 years ago
- Building a quick conversation-based search demo with Lepton AI.☆8,108Dec 2, 2025Updated 3 months ago
- Question and Answer based on Anything.☆13,906Mar 24, 2025Updated last year
- open-source agentic AI data assistant for the next generation of AI + Data products.☆18,360Updated this week
- FaceChain is a deep-learning toolchain for generating your Digital-Twin.☆9,505Jun 6, 2025Updated 9 months ago
- ☆8,688Oct 9, 2024Updated last year
- The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling st…☆2,478Nov 7, 2024Updated last year