apify / crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
☆16,521Updated this week
Alternatives and similar repositories for crawlee:
Users that are interested in crawlee are comparing it to the libraries listed below
- Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ in…☆55,087Updated this week
- Trigger.dev is the open source background jobs platform.☆9,912Updated this week
- 🧩 The Browser Extension Framework☆10,931Updated 2 months ago
- ♾ Infisical is the open-source secret management platform: Sync secrets across your team/infrastructure, prevent secret leaks, and manage…☆16,249Updated this week
- Next-Generation full text search library for Browser and Node.js☆12,657Updated 6 months ago
- Deploy headless browsers in Docker. Run on our cloud or bring your own. Free for non-commercial uses.☆9,151Updated this week
- 🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get …☆23,253Updated this week
- Simple, powerful and flexible site generation framework with everything you love from Next.js.☆12,145Updated this week
- Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Dow…☆5,077Updated this week
- 🧙♀️ Move Fast and Break Nothing. End-to-end typesafe APIs made easy.☆35,567Updated this week
- Web framework built on Web Standards☆21,542Updated this week
- Open-source developer platform to power your entire infra and turn scripts into webhooks, workflows and UIs. Fastest workflow engine (13x…☆11,651Updated this week
- Tiny and powerful JavaScript full-text search engine for browser and Node☆5,008Updated last month
- A standalone version of the readability lib☆9,260Updated this week
- The fastest knowledge base for growing teams. Beautiful, realtime collaborative, feature packed, and markdown compatible.☆29,324Updated this week
- Umami is a simple, fast, privacy-focused alternative to Google Analytics.☆23,753Updated this week
- Scheduling infrastructure for absolutely everyone.☆33,409Updated this week
- Notion-style WYSIWYG editor with AI-powered autocompletion.☆13,586Updated this week
- An open-source & self-hostable Heroku / Netlify / Vercel alternative.☆36,375Updated this week
- Chrome extension that records your browser interactions and generates a Playwright or Puppeteer script.☆15,070Updated 2 years ago
- Package your Node.js project into an executable☆24,337Updated last year
- Enlightened library to convert HTML and CSS to SVG☆11,347Updated last week
- A single API for all your integrations.☆4,922Updated this week
- Open source alternative to Auth0 / Firebase Auth / AWS Cognito☆13,628Updated this week
- Your friendliest open source AI automation tool ✨ Workflow automation tool 200+ integration / Enterprise automation tool / Zapier Alterna…☆10,949Updated this week
- Open source website builder and Webflow alternative. Webstudio is an advanced visual builder that connects to any headless CMS, supports …☆5,766Updated this week
- 🌼 🌼 🌼 🌼 🌼 The most popular, free and open-source Tailwind CSS component library☆34,923Updated this week
- The open source Zapier alternative. Build workflow automation without spending time and money.☆7,574Updated this week
- highlight.io: The open source, full-stack monitoring platform. Error monitoring, session replay, logging, distributed tracing, and more.☆7,859Updated this week
- A fast, local first, reactive Database for JavaScript Applications https://rxdb.info/☆21,804Updated this week