mnotgod96 / AppAgent
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
☆5,139Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for AppAgent
- Mobile-Agent: The Powerful Mobile Device Operation Assistant Family☆3,000Updated last month
- The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling st…☆1,866Updated 2 weeks ago
- [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild☆3,996Updated this week
- A list of AI autonomous agents☆11,520Updated this week
- 🧙AutoDev: The AI-powered coding wizard(AI 驱动编程助手)with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying ass…☆2,841Updated this week
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,660Updated last month
- A Blazing Fast AI Gateway with integrated Guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.☆6,304Updated last week
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆3,597Updated last month
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆3,505Updated last month
- ☆9,368Updated 3 months ago
- Large Action Model framework to develop AI Web Agents☆5,477Updated this week
- Building a quick conversation-based search demo with Lepton AI.☆7,842Updated last week
- A framework to enable multimodal models to operate a computer.☆8,868Updated 3 months ago
- An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents☆5,303Updated last month
- An Autonomous LLM Agent for Complex Task Solving☆8,171Updated 3 months ago
- A code-first agent framework for seamlessly planning and executing data analytics tasks.☆5,355Updated this week
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,096Updated 5 months ago
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆13,971Updated this week
- official repository of aiXcoder-7B Code Large Language Model☆2,223Updated 2 months ago
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆5,185Updated 2 weeks ago
- Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.☆9,783Updated this week
- A collection of GPT system prompts and various prompt injection/leaking knowledge.☆8,210Updated 3 weeks ago
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆6,985Updated this week
- FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…☆18,395Updated this week
- Automate browser-based workflows with LLMs and Computer Vision☆10,475Updated this week
- Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model☆3,307Updated 2 weeks ago
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆18,840Updated this week
- ModelScope-Agent: An agent framework connecting models in ModelScope with the world☆2,722Updated last week
- AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI☆983Updated 2 months ago
- 👾 Open source implementation of the ChatGPT Code Interpreter☆3,794Updated 2 weeks ago