bitmakerla / estelaLinks
estela, an elastic web scraping cluster πΈ
β181Updated this week
Alternatives and similar repositories for estela
Users that are interested in estela are comparing it to the libraries listed below
Sorting:
- Page Object pattern for Scrapyβ121Updated last week
- Home of the Ulixee Open Data Platformβ52Updated this week
- Scrapy download handler that can impersonate browser' TLS signatures or JA3 fingerprints.β165Updated 3 weeks ago
- Web scraping Page Objects core libraryβ101Updated last week
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pacβ¦β280Updated 2 weeks ago
- Minimal set of tools to conduct stealthy scraping.β156Updated 2 years ago
- Scrapy extension that gives you all the scraping monitoring, alerting, scheduling, and data validation you will need straight out of theβ¦β37Updated 10 months ago
- Zyte Automatic Extraction integration for Scrapyβ56Updated 3 years ago
- The Web Scraping Club Free Repositoryβ144Updated 3 weeks ago
- A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteeβ¦β94Updated 2 years ago
- π·οΈ Scrapyd is an application for deploying and running Scrapy spiders.β84Updated 3 weeks ago
- β131Updated last year
- Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.β70Updated 3 years ago
- Distributed crawling infrastructure running on top of severless computation, cloud storage (such as S3) and sophisticated queues.β430Updated 2 years ago
- Scrapy project boilerplate done rightβ46Updated 3 months ago
- playwright stealthβ689Updated 10 months ago
- Library that helps use puppeteer in scrapy.β52Updated last month
- Extract price amount and currency symbol from a raw text stringβ331Updated 3 months ago
- Lego AI Parser is an open-source application that uses OpenAI to parse visible text of HTML elements.β233Updated 11 months ago
- Common interface for data container classesβ67Updated 2 months ago
- A test suite of common scraper detection techniques. See how detectable your scraper stack is.β136Updated 2 years ago
- Zyte API integration for Scrapyβ38Updated 2 weeks ago
- Clean, filter and sample URLs to optimize data collection β Python & command-line β Deduplication, spam, content and language filtersβ138Updated 5 months ago
- Patching CDP (Chrome DevTools Protocol) leaks on OS level. Easy to use with Playwright, Selenium, and other web automation tools.β121Updated 9 months ago
- Browser fingerprint data generatorβ63Updated 2 months ago
- πΆ Awesome list of Scrapy tools and librariesβ59Updated 4 years ago
- Super Fast, Super Anti-Detect, and Super Intuitive Web Driverβ67Updated last month
- A blazing-fast Python HTTP Client with TLS fingerprintβ431Updated this week
- π Intelligent browser header & fingerprint generatorβ567Updated 2 months ago
- A suite of tools for protecting the web's open knowledge.β127Updated 8 months ago