web-infra-dev / midscene
Let AI be your browser operator.
☆5,229Updated this week
Alternatives and similar repositories for midscene:
Users that are interested in midscene are comparing it to the libraries listed below
- QA via natural language AI tests☆4,156Updated this week
- Task-Aware Agent-driven Prompt Optimization Framework☆2,430Updated 2 weeks ago
- Run AI Agent in your browser.☆4,447Updated this week
- library & platform to build, distribute, monetize ai apps that have the full context (like rewind, granola, etc.), open source, 100% loca…☆11,874Updated this week
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel…☆4,307Updated this week
- 🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web wi…☆3,289Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆5,587Updated 3 weeks ago
- ☆3,345Updated 2 months ago
- A free + OSS logo generator powered by Flux on Together AI☆2,750Updated last week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆5,180Updated last week
- Open-Source No-Code Web Data Extraction Platform. Turn Websites To APIs & Spreadsheets With No-Code Robots In Minutes.☆8,785Updated this week
- Lightpanda: the headless browser designed for AI and automation☆4,222Updated this week
- A fast multimodal LLM for real-time voice☆3,245Updated last week
- Build real-time multimodal AI applications 🤖🎙️📹☆4,891Updated this week
- This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.☆4,753Updated this week
- Find the best cursor rules for your framework and language☆2,305Updated last week
- Automate browser-based workflows with LLMs and Computer Vision☆11,800Updated this week
- Stay on top of trending topics on social media and the web with AI☆2,527Updated this week
- 🤖 Open-source GenBI AI Agent that empowers data-driven teams to chat with their data to generate Text-to-SQL, charts, spreadsheets, repo…☆5,344Updated this week
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning a…☆4,740Updated last week
- TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time c…☆4,034Updated this week
- Roo Code (prev. Roo Cline) is a VS Code plugin that enhances coding with AI-powered automation, multi-model support, and experimental fea…☆4,272Updated this week
- An AI agent that writes (actually useful) code for you☆3,638Updated 2 months ago
- Flexible and powerful framework for managing multiple AI agents and handling complex conversations☆3,917Updated last week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆23,008Updated this week
- ☆2,980Updated this week
- 📄 A curated list of awesome .cursorrules files☆8,453Updated this week
- The open-source AI-native IDE☆1,648Updated this week