nordeim / OmniParser2.0_PyautoguiLinks
Local Deployment of OmniParser v2.0 with pyautogui for True Automated Clicking!
☆36Updated 7 months ago
Alternatives and similar repositories for OmniParser2.0_Pyautogui
Users that are interested in OmniParser2.0_Pyautogui are comparing it to the libraries listed below
Sorting:
- 构建一个前端页面,通过flask框架实现OpenManus的前端调用。☆199Updated 5 months ago
- An common framework for voice and text interactions with LLMs☆95Updated 10 months ago
- mirror of https://huggingface.co/spaces/enzostvs/deepsite☆78Updated 5 months ago
- AutoGen最新架构v0.4正式发布第一个稳定版本,v0.4是对AutoGen的一次从头开始的重写,目的是为构建Agent创建一个更健壮、可扩展、更易用的跨语言库,其应用接口采用分层架构设计,存在多套软件接口用以满足不同的场景需求 。☆110Updated 5 months ago
- 😆 Generate PPT by LLM follow your template. 📢 Not only use llm to generate ppt, but also according to your favorite ppt template. Just…☆92Updated last year
- Sample GLM4V + ChatTTS AI assistant☆85Updated last year
- 本项目基于 [bytedance/deer-flow](https://github.com/bytedance/deer-flow) 二次开发,专为中文用户优化,支持一键部署、SearXNG 集成、SSL 证书等。☆51Updated last month
- 微软开源多Agent智能体协作框架AutoGen全新改版核心概念介绍及相关案例测试☆46Updated 9 months ago
- RTC AIGC Demo☆199Updated last month
- generate ppt with llm☆101Updated last year
- The Python SDK for the Coze API☆407Updated 2 weeks ago
- The KedoAI Process Intelligent Agent Development Platform is a cutting-edge platform designed to simplify the development of intelligent …☆19Updated 3 months ago
- Unsloth框架在Windows平台微调训练Qwen2大模型,非WSL☆61Updated last year
- 世界上最好的MCP Servers的列表,The best mcp servers in the world.☆76Updated 5 months ago
- 支持查询主流agent框架技术文档的MCP server(支持stdio和sse两种传输协议), 支持 langchain、llama-index、autogen、agno、openai-agents-sdk、mcp-doc、camel-ai 和 crew-ai☆136Updated 4 months ago
- Convert files into markdown to help RAG or LLM understand, based on markitdown and MinerU, which could provide high quality pdf parser.☆126Updated 5 months ago
- 基于MoneyPrinterTurbo,AI生成分镜大纲与视频(动态,不是念ppt),接入万相通义wan2.1 ai文生视频、图生视频功能,灵活把控视频生成。Based on MoneyPrinterTurbo, AI generates image outline and…☆195Updated 4 months ago
- Ai-To-PPTX Backend PHP >=7.4 + Redis☆110Updated 7 months ago
- 用于提供给本地开发者的 LLM的高效互联网搜索&内容获取的MCP Server, 节省你的token☆113Updated 4 months ago
- 一个用于F5-TTS的api和webui项目☆63Updated 8 months ago
- 使用 FastAPI、Streamlit本地部署ChatTTS文本转语音模型,并通过 Docker Compose 进行容器化部署。☆27Updated 11 months ago
- 数字人开口说话,采用 live2d 数字人模型 + edge-tts (文本语音合成)☆62Updated last year
- 基于Linly-Talker数字人改版的教育系统,包含网课总结、数字人对话、Chatbot对话,项目可在autodl部署☆32Updated last year
- The web UI for LangManus.☆144Updated 6 months ago
- 使用CrewAI+FastAPI搭建多Agent协作应用并对外提供API服务,同时支持gpt、国产大模型、Ollama本地大模型。☆82Updated 11 months ago
- GraphRAG-Ollama-UI + GraphRAG4OpenWebUI 融合版(有gradio webui配置生成RAG索引,有fastapi提供RAG API服务)☆107Updated last year
- langchain 工具,流程设计组件,服务,代理以及相关学习文档的合集(agent,service,tutorials,flow-design)☆134Updated last year
- MCP Client as an Agent Strategy Plugin. Support GUI operation via UI-TARS-SDK.☆154Updated 2 months ago
- Python package for doing RPA☆91Updated this week
- Dive into LLM Agents☆19Updated last year