apify / crawlee-pythonView on GitHub
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
8,108Updated this week

Alternatives and similar repositories for crawlee-python

Users that are interested in crawlee-python are comparing it to the libraries listed below

Sorting:

Are these results useful?