Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
☆22,366Mar 16, 2026Updated this week
Alternatives and similar repositories for crawlee
Users that are interested in crawlee are comparing it to the libraries listed below
Sorting:
- 🔥 🔥 🔥 A Free & Self-hostable Airtable Alternative☆62,480Updated this week
- Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.☆84,454Updated this week
- JavaScript API for Chrome and Firefox☆93,818Updated this week
- 🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data☆93,251Updated this week
- The open-source notification Inbox infrastructure. E-mail, SMS, Push and Slack Integrations.☆38,699Updated this week
- The Postgres development platform. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.☆99,322Updated this week
- Deploy headless browsers in Docker. Run on our cloud or bring your own. Free for non-commercial uses.☆12,774Updated this week
- 🧩 The Browser Extension Framework☆12,922Mar 14, 2026Updated last week
- Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Dow…☆8,612Mar 13, 2026Updated last week
- Web framework built on Web Standards☆29,354Mar 14, 2026Updated last week
- React Flow | Svelte Flow - Powerful open source libraries for building node-based UIs with React (https://reactflow.dev) or Svelte (https…☆35,661Mar 10, 2026Updated last week
- ✨ The Next Gen Airtable Alternative: No-Code Postgres☆21,031Updated this week
- Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ in…☆179,955Updated this week
- Trigger.dev – build and deploy fully‑managed AI agents and workflows☆14,038Mar 14, 2026Updated last week
- Build smaller, faster, and more secure desktop and mobile applications with a web frontend.☆104,217Updated this week
- A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.☆56,581Updated this week
- Open Source realtime backend in 1 file☆56,811Updated this week
- 🚀 Strapi is the leading open-source headless CMS. It’s 100% JavaScript/TypeScript, fully customizable, and developer-first.☆71,663Updated this week
- The AI Browser Automation Framework☆21,583Updated this week
- 🧙♀️ Move Fast and Break Nothing. End-to-end typesafe APIs made easy.☆39,782Updated this week
- A React Framework for building internal tools, admin panels, dashboards & B2B apps with unmatched flexibility.☆34,240Updated this week
- 🌌 A complete search engine and RAG pipeline in your browser, server or edge network with support for full-text, vector, and hybrid sear…☆10,237Feb 13, 2026Updated last month
- A set of beautifully-designed, accessible components and a code distribution platform. Works with your favorite frameworks. Open Source. …☆109,915Updated this week
- The headless rich text editor framework for web artisans.☆35,709Updated this week
- Crawl a site to generate knowledge files to create your own custom GPT from a URL☆22,199Jul 7, 2025Updated 8 months ago
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆62,080Updated this week
- Platform to build admin panels, internal tools, and dashboards. Integrates with 25+ databases and any API.☆39,375Updated this week
- The flexible backend for all your projects 🐰 Turn your DB into a headless CMS, admin panels, or apps with a custom UI, instant APIs, aut…☆34,480Mar 13, 2026Updated last week
- The modern link attribution platform. Loved by world-class marketing teams like Framer, Perplexity, Superhuman, Twilio, Buffer and more.☆23,206Updated this week
- An open-source, self-hostable PaaS alternative to Vercel, Heroku & Netlify that lets you easily deploy static sites, databases, full-stac…☆51,710Mar 14, 2026Updated last week
- Payload is the open-source, fullstack Next.js framework, giving you instant backend superpowers. Get a full TypeScript backend and admin …☆41,267Updated this week
- very good whiteboard infinite canvas SDK☆45,832Updated this week
- The world's most flexible commerce platform.☆32,359Updated this week
- Next-generation ORM for Node.js & TypeScript | PostgreSQL, MySQL, MariaDB, SQL Server, SQLite, MongoDB and CockroachDB☆45,535Mar 13, 2026Updated last week
- ToolJet is the open-source foundation of ToolJet AI - the AI-native platform for building internal tools, dashboard, business application…☆37,623Updated this week
- Appwrite® - complete cloud infrastructure for your web, mobile and AI apps. Including Auth, Databases, Storage, Functions, Messaging, Hos…☆55,149Updated this week
- Build AI Agents, Visually☆50,762Mar 14, 2026Updated last week
- Build AI Agents the easy way. Supports PostgreSQL, MySQL, MariaDB, MSSQL, MongoDB, Rest API, Docker, K8s, and more 🚀 AI Workflow toolki…☆27,734Updated this week
- The web framework for content-driven websites. ⭐️ Star to support our work!☆57,552Updated this week