bitmakerla / estelaLinks
estela, an elastic web scraping cluster 🕸
☆184Updated 3 weeks ago
Alternatives and similar repositories for estela
Users that are interested in estela are comparing it to the libraries listed below
Sorting:
- The Web Scraping Club Free Repository☆145Updated last month
- Scrapy rotation proxy package with advanced functions☆95Updated 2 years ago
- Home of the Ulixee Open Data Platform☆55Updated 3 weeks ago
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pac…☆284Updated last month
- ☆74Updated 4 months ago
- Minimal set of tools to conduct stealthy scraping.☆156Updated 2 years ago
- Distributed crawling infrastructure running on top of severless computation, cloud storage (such as S3) and sophisticated queues.☆430Updated 2 years ago
- Zyte Automatic Extraction integration for Scrapy☆56Updated 3 years ago
- Scrapy Extension for monitoring spiders execution.☆542Updated 2 months ago
- Page Object pattern for Scrapy☆123Updated 3 weeks ago
- Scrapy download handler that can impersonate browser' TLS signatures or JA3 fingerprints.☆172Updated last month
- 🎭 Intelligent browser header & fingerprint generator☆601Updated 3 months ago
- Patching CDP (Chrome DevTools Protocol) leaks on OS level. Easy to use with Playwright, Selenium, and other web automation tools.☆125Updated 10 months ago
- Get structured JSON data from any page.☆176Updated last year
- ☆133Updated last year
- Web scraping Page Objects core library☆101Updated 2 weeks ago
- A test suite of common scraper detection techniques. See how detectable your scraper stack is.☆138Updated 2 years ago
- 🕷️ Scrapyd is an application for deploying and running Scrapy spiders.☆85Updated last month
- 🚀 Web scraping for humans☆897Updated 6 months ago
- Lego AI Parser is an open-source application that uses OpenAI to parse visible text of HTML elements.☆233Updated last year
- playwright stealth☆707Updated 10 months ago
- A suite of tools for protecting the web's open knowledge.☆128Updated 9 months ago
- Camoufox Integration For Scrapy☆3Updated 5 months ago
- Undetectable browser automation in Docker using Python/Zendriver. Full VNC debugging support.☆37Updated last month
- A blazing-fast Python HTTP Client with TLS fingerprint☆501Updated this week
- Use AWS Lambda functions as a proxy pool to scrape web pages.☆133Updated last year
- Common interface for data container classes☆68Updated 3 months ago
- Detect and classify pagination links☆103Updated 4 years ago
- Super Fast, Super Anti-Detect, and Super Intuitive Web Driver☆70Updated last week
- A drop-in replacement for playwright-python patched with rebrowser-patches. It allows to pass modern automation detection tests.☆75Updated last month