TencentQQGYLab / AppAgent
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
☆5,481Updated 6 months ago
Alternatives and similar repositories for AppAgent:
Users that are interested in AppAgent are comparing it to the libraries listed below
- Mobile-Agent: The Powerful Mobile Device Operation Assistant Family☆3,426Updated last week
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆5,818Updated 3 weeks ago
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆5,453Updated 6 months ago
- The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling st…☆1,996Updated 3 months ago
- Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you ne…☆6,387Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆17,621Updated this week
- [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild☆4,136Updated 3 months ago
- An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)☆4,232Updated 3 weeks ago
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,353Updated 8 months ago
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆7,856Updated this week
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,868Updated 3 weeks ago
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆7,031Updated last month
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆16,871Updated 2 weeks ago
- A unified evaluation framework for large language models☆2,529Updated last week
- official repository of aiXcoder-7B Code Large Language Model☆2,239Updated last month
- ModelScope-Agent: An agent framework connecting models in ModelScope with the world☆2,915Updated last month
- MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.☆7,200Updated 3 months ago
- Build resilient language agents as graphs.☆9,076Updated this week
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆5,996Updated last month
- OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophist…☆1,634Updated 9 months ago
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,241Updated 9 months ago
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆40,769Updated this week
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆5,583Updated this week
- GPT4V-level open-source multi-modal model based on Llama3-8B☆2,257Updated 5 months ago
- ChatOllama is an open source chatbot based on LLMs. It supports a wide range of language models, and knowledge base management.☆3,015Updated last week
- Go ahead and axolotl questions☆8,620Updated this week
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,838Updated 4 months ago
- An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents☆5,461Updated 4 months ago
- An LLM-based Web Navigating Agent (KDD'24)☆811Updated 4 months ago
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone☆18,484Updated this week