apify / crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
☆17,319Updated this week
Alternatives and similar repositories for crawlee:
Users that are interested in crawlee are comparing it to the libraries listed below
- ✨ The Next Gen Airtable Alternative: No-Code Postgres☆17,188Updated this week
- Open Source realtime backend in 1 file☆44,897Updated last week
- Write components once, run everywhere. Compiles to React, Vue, Qwik, Solid, Angular, Svelte, and more.☆12,953Updated this week
- An open-source & self-hostable Heroku / Netlify / Vercel alternative.☆39,337Updated this week
- The open source Firebase alternative. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.☆79,959Updated this week
- 🔥 🔥 🔥 Open Source Airtable Alternative☆53,436Updated this week
- React Flow | Svelte Flow - Powerful open source libraries for building node-based UIs with React (https://reactflow.dev) or Svelte (https…☆28,552Updated this week
- Build system optimized for JavaScript and TypeScript, written in Rust☆27,342Updated this week
- Open-source developer platform to power your entire infra and turn scripts into webhooks, workflows and UIs. Fastest workflow engine (13x…☆12,593Updated this week
- Relocate resource intensive third-party scripts off of the main thread and into a web worker. 🎉☆13,267Updated last month
- Distributed crawler powered by Headless Chrome☆5,560Updated last year
- Amplication brings order to the chaos of large-scale software development by creating Golden Paths for developers - streamlined workflows…☆15,601Updated this week
- Platform to build admin panels, internal tools, and dashboards. Integrates with 25+ databases and any API.☆36,482Updated this week
- Deploy headless browsers in Docker. Run on our cloud or bring your own. Free for non-commercial uses.☆9,756Updated this week
- Build like a team of hundreds_☆47,844Updated this week
- Simple, open source, lightweight (< 1 KB) and privacy-friendly web analytics alternative to Google Analytics.☆21,909Updated this week
- Penpot: The open-source design tool for design and code collaboration☆37,181Updated this week
- A fully featured React components library☆28,197Updated this week
- Shared data types for building collaborative software☆18,840Updated this week
- 🧩 The Browser Extension Framework☆11,446Updated this week
- Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuz…☆22,554Updated this week
- IDE-style autocomplete for your existing terminal & shell☆24,845Updated this week
- A standalone version of the readability lib☆9,688Updated last week
- Puppeteer Pool, run a cluster of instances in parallel☆3,366Updated 11 months ago
- Enlightened library to convert HTML and CSS to SVG☆11,612Updated this week
- Open Source Alternative to Vercel, Netlify and Heroku.☆18,554Updated this week
- Trigger.dev – open source background jobs and AI infrastructure☆10,572Updated this week
- A tool for writing better scripts☆43,846Updated this week
- Create business apps and automate workflows in minutes. Supports PostgreSQL, MySQL, MariaDB, MSSQL, MongoDB, Rest API, Docker, K8s, and m…☆23,738Updated this week
- whiteboard SDK / infinite canvas SDK☆39,391Updated this week