mnotgod96 / AppAgent
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
☆4,859Updated last month
Related projects: ⓘ
- Mobile-Agent: The Powerful Mobile Device Operation Assistant Family☆2,677Updated 2 weeks ago
- Building a quick conversation-based search demo with Lepton AI.☆7,706Updated 2 weeks ago
- [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild☆3,900Updated 2 months ago
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆4,578Updated last week
- MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone☆11,907Updated this week
- Question and Answer based on Anything.☆11,376Updated this week
- Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you ne…☆4,793Updated this week
- A code-first agent framework for seamlessly planning and executing data analytics tasks.☆5,194Updated this week
- An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents☆5,162Updated last week
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆3,415Updated last month
- The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling st…☆1,715Updated last week
- Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆3,093Updated 2 weeks ago
- An Autonomous LLM Agent for Complex Task Solving☆8,033Updated last month
- A framework to enable multimodal models to operate a computer.☆8,590Updated last month
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆6,363Updated this week
- A series of large language models trained from scratch by developers @01-ai☆7,598Updated last week
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆5,491Updated last week
- a state-of-the-art-level open visual language model | 多模态预训练模型☆5,871Updated 3 months ago
- A list of AI autonomous agents☆9,688Updated last month
- MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.☆6,824Updated last week
- ☆7,075Updated last month
- Python SDK, Proxy Server to call 100+ LLM APIs using the OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker,…☆12,231Updated this week
- Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions☆7,405Updated 3 weeks ago
- Automate browser-based workflows with LLMs and Computer Vision☆5,768Updated this week
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆17,176Updated this week
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆13,305Updated 2 weeks ago
- Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.☆9,185Updated this week
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,542Updated 2 weeks ago
- ☆6,415Updated last week
- 🧙AutoDev: The AI-powered coding wizard(AI 驱动编程助手) with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying as…☆2,740Updated last week