apify / crawleeLinks
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
☆21,219Updated this week
Alternatives and similar repositories for crawlee
Users that are interested in crawlee are comparing it to the libraries listed below
Sorting:
- Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Dow…☆7,583Updated this week
- The fastest knowledge base for growing teams. Beautiful, realtime collaborative, feature packed, and markdown compatible.☆36,841Updated this week
- State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!☆15,233Updated this week
- The world's most flexible commerce platform.☆31,808Updated this week
- The Postgres development platform. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.☆96,722Updated this week
- An extremely fast bundler for the web☆39,639Updated last month
- Next-generation full-text search library for Browser and Node.js☆13,553Updated 4 months ago
- Session replay, cobrowsing and product analytics you can self-host. Best for reproducing issues and iterating on your product.☆11,680Updated this week
- Build smaller, faster, and more secure desktop and mobile applications with a web frontend.☆101,684Updated this week
- ✨ The Next Gen Airtable Alternative: No-Code Postgres☆20,757Updated this week
- Build system optimized for JavaScript and TypeScript, written in Rust☆29,630Updated this week
- Platform to build admin panels, internal tools, and dashboards. Integrates with 25+ databases and any API.☆38,940Updated last week
- The headless Chrome/Chromium driver on top of Puppeteer.☆1,768Updated this week
- BullMQ - Message Queue and Batch processing for NodeJS, Python, Elixir and PHP based on Redis☆8,290Updated this week
- A fast, local first, reactive Database for JavaScript Applications https://rxdb.info/☆22,993Updated this week
- A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.☆55,559Updated this week
- Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.☆81,562Updated this week
- The flexible backend for all your projects 🐰 Turn your DB into a headless CMS, admin panels, or apps with a custom UI, instant APIs, aut…☆34,064Updated this week
- 🔥 🔥 🔥 A Free & Self-hostable Airtable Alternative☆61,602Updated this week
- 🧩 The Browser Extension Framework☆12,805Updated this week
- The headless rich text editor framework for web artisans.☆34,711Updated this week
- Deploy headless browsers in Docker. Run on our cloud or bring your own. Free for non-commercial uses.☆12,272Updated this week
- Next-generation ORM for Node.js & TypeScript | PostgreSQL, MySQL, MariaDB, SQL Server, SQLite, MongoDB and CockroachDB☆45,106Updated this week
- End-to-end realtime stack for connecting humans and AI☆16,708Updated this week
- ZincSearch . A lightweight alternative to elasticsearch that requires minimal resources, written in Go.☆17,711Updated this week
- 🌐 Human-friendly and powerful HTTP request library for Node.js☆14,863Updated 3 weeks ago
- Open Source realtime backend in 1 file☆55,529Updated this week
- Write components once, run everywhere. Compiles to React, Vue, Qwik, Solid, Angular, Svelte, and more.☆13,690Updated last week
- Open-source developer platform to power your entire infra and turn scripts into webhooks, workflows and UIs. Fastest workflow engine (13x…☆15,637Updated this week
- Instant-loading web apps, without effort☆21,888Updated this week