web-infra-dev / midsceneLinks
Driving all platforms UI automation with vision-based model
☆11,076Updated this week
Alternatives and similar repositories for midscene
Users that are interested in midscene are comparing it to the libraries listed below
Sorting:
- Pioneering Automated GUI Interaction with Native Agents☆8,710Updated last week
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra☆20,269Updated last week
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural l…☆3,791Updated 7 months ago
- 🖥️ Run AI Agent in your browser.☆15,374Updated 4 months ago
- Playwright Model Context Protocol Server - Tool to automate Browsers and APIs in Claude Desktop, Cline, Cursor IDE and More 🔌☆5,093Updated 2 weeks ago
- Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI O…☆11,726Updated last month
- Vibe Workflow Platform for Non-technical Creators.☆5,832Updated this week
- Kortix – build, manage and train AI Agents.☆18,902Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆24,102Updated 3 months ago
- The AI Browser Automation Framework☆19,757Updated this week
- 🔥 Official Firecrawl MCP Server - Adds powerful web scraping and search to Cursor, Claude and any other LLM clients.☆5,136Updated last week
- Craft AI-driven interface effortlessly🤖☆4,144Updated this week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆74,239Updated this week
- FlowGram is an extensible workflow development framework with built-in canvas, form, variable, and materials that helps developers build …☆7,475Updated last week
- A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.☆5,365Updated 2 months ago
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆5,635Updated 5 months ago
- Agent S: an open agentic framework that uses computers like a human☆9,211Updated 2 weeks ago
- Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai☆4,811Updated this week
- Task-Aware Agent-driven Prompt Optimization Framework☆3,723Updated 2 months ago
- ⚙️ Create and run workflows (RPA 2.0)☆3,835Updated last week
- Toolkit for linearizing PDFs for LLM datasets/training☆16,483Updated last week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆7,276Updated last month
- 🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web wit…☆6,144Updated last week
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording☆16,318Updated 3 weeks ago
- DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execut…☆18,846Updated this week
- QA via natural language AI tests☆5,458Updated 4 months ago
- 5ire is a cross-platform desktop AI assistant, MCP client. It compatible with major service providers, supports local knowledge base and…☆4,887Updated last week
- Chrome MCP Server is a Chrome extension-based Model Context Protocol (MCP) server that exposes your Chrome browser functionality to AI as…☆9,701Updated this week
- ☆3,508Updated last year
- Lightpanda: the headless browser designed for AI and automation☆11,222Updated this week