apify/crawlee

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/apify/crawlee)

apify / crawlee

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

☆24,722

Alternatives and similar repositories for crawlee

Users that are interested in crawlee are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nocodb / nocodb
View on GitHub
🔥 🔥 🔥 A Free & Self-hostable Airtable Alternative
☆64,010Updated this week
microsoft / playwright
View on GitHub
Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
☆92,901Updated this week
puppeteer / puppeteer
View on GitHub
JavaScript API for Chrome and Firefox
☆95,450Updated this week
novuhq / novu
View on GitHub
The open-source communication infrastructure for agents and products
☆39,324Updated this week
browserless / browserless
View on GitHub
Deploy headless browsers in Docker. Run on our cloud or bring your own. Free for non-commercial uses.
☆13,476Updated this week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
PlasmoHQ / plasmo
View on GitHub
🧩 The Browser Extension Framework
☆13,106Updated this week
honojs / hono
View on GitHub
Web framework built on Web Standards
☆31,361Updated this week
apify / crawlee-python
View on GitHub
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Dow…
☆9,318Updated this week
xyflow / xyflow
View on GitHub
React Flow | Svelte Flow - Powerful open source libraries for building node-based UIs with React (https://reactflow.dev) or Svelte (https…
☆37,650Updated this week
triggerdotdev / trigger.dev
View on GitHub
Trigger.dev – build and deploy fully‑managed AI agents and workflows
☆15,655Updated this week
teableio / teable
View on GitHub
✨ The Next Gen Airtable Alternative: No-Code Postgres
☆21,498Updated this week
tauri-apps / tauri
View on GitHub
Build smaller, faster, and more secure desktop and mobile applications with a web frontend.
☆109,079Updated this week
meilisearch / meilisearch
View on GitHub
A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.
☆58,603Updated this week
strapi / strapi
View on GitHub
🚀 Strapi is the leading open-source headless CMS. It’s 100% JavaScript/TypeScript, fully customizable, and developer-first.
☆72,664Updated this week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
pocketbase / pocketbase
View on GitHub
Open Source realtime backend in 1 file
☆59,713Updated this week
refinedev / refine
View on GitHub
A React Framework for building internal tools, admin panels, dashboards & B2B apps with unmatched flexibility.
☆35,301Jun 5, 2026Updated last month
trpc / trpc
View on GitHub
🧙‍♀️ Move Fast and Break Nothing. End-to-end typesafe APIs made easy.
☆40,433Jul 9, 2026Updated last week
browserbase / stagehand
View on GitHub
The SDK For Browser Agents
☆23,516Updated this week
shadcn-ui / ui
View on GitHub
A set of beautifully-designed, accessible components and a code distribution platform. Works with your favorite frameworks. Open Source. …
☆119,177Updated this week
oramasearch / orama
View on GitHub
🌌 A complete search engine and RAG pipeline in your browser, server or edge network with support for full-text, vector, and hybrid sear…
☆10,486Jul 3, 2026Updated 2 weeks ago
unclecode / crawl4ai
View on GitHub
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
☆72,770Updated this week
ueberdosis / tiptap
View on GitHub
The headless rich text editor framework for web artisans.
☆37,627Updated this week
BuilderIO / gpt-crawler
View on GitHub
Crawl a site to generate knowledge files to create your own custom GPT from a URL
☆22,260Jul 7, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
payloadcms / payload
View on GitHub
Payload is the open-source, fullstack Next.js framework, giving you instant backend superpowers. Get a full TypeScript backend and admin …
☆43,599Updated this week
appsmithorg / appsmith
View on GitHub
Platform to build admin panels, internal tools, and dashboards. Integrates with 25+ databases and any API.
☆40,337Updated this week
directus / directus
View on GitHub
The flexible backend for all your projects 🐰 Turn your DB into a headless CMS, admin panels, or apps with a custom UI, instant APIs, aut…
☆36,486Updated this week
dubinc / dub
View on GitHub
The modern link attribution platform. Loved by world-class marketing teams like Framer, Perplexity, Superhuman, Twilio, Buffer and more.
☆24,032Updated this week
tldraw / tldraw
View on GitHub
Build infinite canvas apps in React with the tldraw SDK. World's best, top-most agent recommended #1 five star SDK.
☆48,781Updated this week
coollabsio / coolify
View on GitHub
An open-source, self-hostable PaaS alternative to Vercel, Heroku & Netlify that lets you easily deploy static sites, databases, full-stac…
☆58,593Updated this week
medusajs / medusa
View on GitHub
The world's most flexible commerce platform.
☆35,176Updated this week
prisma / prisma
View on GitHub
Next-generation ORM for Node.js & TypeScript | PostgreSQL, MySQL, MariaDB, SQL Server, SQLite, MongoDB and CockroachDB
☆47,363Updated this week
appwrite / appwrite
View on GitHub
Appwrite® - complete cloud infrastructure for your web, mobile and AI apps. Including Auth, Databases, Storage, Functions, Messaging, Hos…
☆56,594Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ToolJet / ToolJet
View on GitHub
ToolJet is the open-source foundation of ToolJet AI - the enterprise app generation platform for building internal tools, dashboard, busi…
☆38,211Updated this week
withastro / astro
View on GitHub
The web framework for content-driven websites. ⭐️ Star to support our work!
☆61,044Updated this week
FlowiseAI / Flowise
View on GitHub
Build AI Agents, Visually
☆54,656Updated this week
Budibase / budibase
View on GitHub
AI agents, automations and apps that run your operations. Model agnostic.
☆28,120Updated this week
PostHog / posthog
View on GitHub
🦔 PostHog is an all-in-one developer platform for building successful products. We offer product analytics, web analytics, session repla…
☆35,550Updated this week
makeplane / plane
View on GitHub
🔥🔥🔥 Open-source Jira, Linear, Monday, and ClickUp alternative. Plane is a modern project management platform to manage tasks, sprints,…
☆54,518Updated this week
umami-software / umami
View on GitHub
Umami is a modern, privacy-focused analytics platform. An open-source alternative to Google Analytics, Mixpanel and Amplitude.
☆37,683Updated this week