microsoft / UFO
A UI-Focused Agent for Windows OS Interaction.
☆6,504Updated this week
Alternatives and similar repositories for UFO:
Users that are interested in UFO are comparing it to the libraries listed below
- The open source platform for AI-native application development.☆5,063Updated last month
- Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language m…☆4,247Updated last week
- BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI…☆7,497Updated this week
- Your Automatic Prompt Engineering Assistant for GenAI Applications☆2,082Updated 8 months ago
- "LightRAG: Simple and Fast Retrieval-Augmented Generation"☆12,800Updated this week
- Developer AI Persona Search Agent☆1,766Updated this week
- Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance☆4,117Updated 6 months ago
- Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation☆3,419Updated last month
- Mobile-Agent: The Powerful Mobile Device Operation Assistant Family☆3,180Updated 3 months ago
- 🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides …☆4,292Updated 4 months ago
- [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.☆4,848Updated last month
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆8,119Updated 4 months ago
- The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"☆796Updated this week
- MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.☆7,066Updated 2 months ago
- Multi agent system for AI-driven software development. Combine LLM with DevOps tools to convert natural language requirements into workin…☆5,839Updated 5 months ago
- A code-first agent framework for seamlessly planning and executing data analytics tasks.☆5,458Updated this week
- An intelligent assistant serving the entire software development lifecycle, powered by a Multi-Agent Framework, working with DevOps Toolk…☆1,087Updated 6 months ago
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆5,320Updated this week
- A tutorial based on MetaGPT to quickly help you understand the concept of agent and muti-agent and get started with coding development. 基…☆1,092Updated 8 months ago
- SDG is a specialized framework designed to generate high-quality structured tabular data.☆2,280Updated this week
- Build multimodal language agents for very fast prototype and production☆1,198Updated this week
- 【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models☆1,683Updated 2 weeks ago
- Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.☆1,987Updated this week
- [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild☆4,087Updated last month
- MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators☆1,244Updated last week
- 🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing imp…☆3,215Updated 10 months ago
- The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling st…☆1,966Updated 2 months ago
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone☆13,445Updated this week
- An Autonomous LLM Agent for Complex Task Solving☆8,107Updated 5 months ago
- An MBTI Exploration of Large Language Models☆447Updated 11 months ago