apify / crawleeLinks
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
☆18,462Updated last week
Alternatives and similar repositories for crawlee
Users that are interested in crawlee are comparing it to the libraries listed below
Sorting:
- Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Dow…☆5,835Updated this week
- Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.☆74,752Updated this week
- Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuz…☆23,918Updated last week
- Deploy headless browsers in Docker. Run on our cloud or bring your own. Free for non-commercial uses.☆10,667Updated this week
- 🌌 A complete search engine and RAG pipeline in your browser, server or edge network with support for full-text, vector, and hybrid sear…☆9,549Updated this week
- very good whiteboard SDK / infinite canvas SDK☆40,786Updated this week
- 🔥 🔥 🔥 Open Source Airtable Alternative☆55,723Updated this week
- A React Framework for building internal tools, admin panels, dashboards & B2B apps with unmatched flexibility.☆31,599Updated last week
- Visual Development for React, Vue, Svelte, Qwik, and more☆8,283Updated this week
- 💯 Teach puppeteer new tricks through plugins.☆6,936Updated last year
- Relocate resource intensive third-party scripts off of the main thread and into a web worker. 🎉☆13,404Updated last month
- 🧩 The Browser Extension Framework☆12,065Updated this week
- A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.☆52,374Updated this week
- Trigger.dev – open source background jobs and AI infrastructure☆11,724Updated this week
- Connect APIs, remarkably fast. Free for developers.☆10,177Updated this week
- The AI Browser Automation Framework☆13,956Updated this week
- State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!☆14,027Updated last week
- The Postgres development platform. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.☆85,769Updated this week
- highlight.io: The open source, full-stack monitoring platform. Error monitoring, session replay, logging, distributed tracing, and more.☆8,441Updated last week
- Instant-loading web apps, without effort☆21,504Updated this week
- 🔥 🔥 🔥 Open Source JIRA, Linear, Monday, and Asana Alternative. Plane helps you track your issues, epics, and cycles the easiest way on…☆37,242Updated this week
- Create business apps and automate workflows in minutes. Supports PostgreSQL, MySQL, MariaDB, MSSQL, MongoDB, Rest API, Docker, K8s, and m…☆24,929Updated this week
- Lexical is an extensible text editor framework that provides excellent reliability, accessibility and performance.☆21,573Updated this week
- Lightpanda: the headless browser designed for AI and automation☆9,353Updated this week
- High performance, self-hosted, newsletter and mailing list manager with a modern dashboard. Single binary app.☆17,342Updated this week
- A docker-powered PaaS that helps you build and manage the lifecycle of applications☆30,819Updated last week
- 🧙♀️ Move Fast and Break Nothing. End-to-end typesafe APIs made easy.☆37,836Updated last week
- The headless Chrome/Chromium driver on top of Puppeteer.☆1,727Updated last month
- ⚡️ The Missing Fullstack Toolkit for Next.js☆13,943Updated 4 months ago
- 🚀🎉📚 APITable, an API-oriented low-code platform for building collaborative apps and better than all other Airtable open-source alterna…☆14,684Updated 2 months ago