bitmakerla / estelaLinks
estela, an elastic web scraping cluster πΈ
β192Updated last week
Alternatives and similar repositories for estela
Users that are interested in estela are comparing it to the libraries listed below
Sorting:
- The Web Scraping Club Free Repositoryβ155Updated last month
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pacβ¦β297Updated 6 months ago
- Scrapy rotation proxy package with advanced functionsβ95Updated 3 years ago
- Scrapy extension that gives you all the scraping monitoring, alerting, scheduling, and data validation you will need straight out of theβ¦β39Updated last year
- Page Object pattern for Scrapyβ124Updated last month
- Use AWS Lambda functions as a proxy pool to scrape web pages.β139Updated last year
- Scrapy Extension for monitoring spiders execution.β549Updated 8 months ago
- Lego AI Parser is an open-source application that uses OpenAI to parse visible text of HTML elements.β236Updated last year
- Library that helps use puppeteer in scrapy.β52Updated 4 months ago
- Zyte Automatic Extraction integration for Scrapyβ56Updated 3 years ago
- Minimal set of tools to conduct stealthy scraping.β162Updated 2 years ago
- β143Updated 2 years ago
- Home of the Ulixee Open Data Platformβ56Updated 3 months ago
- Distributed crawling infrastructure running on top of severless computation, cloud storage (such as S3) and sophisticated queues.β435Updated 2 years ago
- playwright stealthβ841Updated last year
- Web scraping Page Objects core libraryβ103Updated 2 weeks ago
- β78Updated 5 months ago
- Spider templates for automatic crawlers.β32Updated this week
- dude uncomplicated data extraction: A simple framework for writing web scrapers using Python decoratorsβ428Updated 8 months ago
- A python based HTML to text conversion library, command line client and Web service.β328Updated 3 weeks ago
- Parsing JavaScript objects into Python data structuresβ216Updated 4 months ago
- π Web scraping for humansβ972Updated last year
- Scrapy download handler that can impersonate browser' TLS signatures or JA3 fingerprints.β210Updated 4 months ago
- Comprehensive wrapper and execution manager for the Chrome browser using the Chrome Debugging Protocol.β228Updated 6 months ago
- Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.β70Updated 4 years ago
- Detect and classify pagination linksβ104Updated 2 months ago
- Get structured JSON data from any page.β178Updated 2 years ago
- π Intelligent browser header & fingerprint generatorβ863Updated 8 months ago
- Scrapyd on container infrastructureβ16Updated 8 months ago
- Make sense of it all. Semantic data modeling and analytics with a sprinkle of AI. https://totalhack.github.io/zillion/β204Updated 6 months ago