bitmakerla / estelaLinks
estela, an elastic web scraping cluster πΈ
β189Updated 3 weeks ago
Alternatives and similar repositories for estela
Users that are interested in estela are comparing it to the libraries listed below
Sorting:
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pacβ¦β293Updated 4 months ago
- The Web Scraping Club Free Repositoryβ151Updated 4 months ago
- Scrapy rotation proxy package with advanced functionsβ95Updated 3 years ago
- Lego AI Parser is an open-source application that uses OpenAI to parse visible text of HTML elements.β236Updated last year
- Home of the Ulixee Open Data Platformβ55Updated 2 weeks ago
- Scrapy extension that gives you all the scraping monitoring, alerting, scheduling, and data validation you will need straight out of theβ¦β38Updated last year
- Scrapy Extension for monitoring spiders execution.β545Updated 5 months ago
- Distributed crawling infrastructure running on top of severless computation, cloud storage (such as S3) and sophisticated queues.β432Updated 2 years ago
- Library that helps use puppeteer in scrapy.β52Updated last month
- Use AWS Lambda functions as a proxy pool to scrape web pages.β137Updated last year
- Zyte Automatic Extraction integration for Scrapyβ56Updated 3 years ago
- Minimal set of tools to conduct stealthy scraping.β159Updated 2 years ago
- β139Updated last year
- Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.β70Updated 4 years ago
- playwright stealthβ788Updated last year
- Parsing JavaScript objects into Python data structuresβ212Updated last month
- π Web scraping for humansβ926Updated 9 months ago
- β77Updated 2 months ago
- Scrapyd on container infrastructureβ17Updated 5 months ago
- dude uncomplicated data extraction: A simple framework for writing web scrapers using Python decoratorsβ428Updated 6 months ago
- Scrapy project boilerplate done rightβ48Updated 7 months ago
- Undetected web-scraping & seamless HTML parsing in Python!β293Updated 2 months ago
- Spider templates for automatic crawlers.β31Updated 2 months ago
- Scrapy + Puppeteerβ110Updated 4 years ago
- Python SDK for Inngest: Durable functions and workflows in Python, hosted anywhereβ132Updated this week
- Get structured JSON data from any page.β178Updated last year
- A complimentary proxy to help to use SPM with headless browsersβ108Updated 2 years ago
- π Intelligent browser header & fingerprint generatorβ733Updated 6 months ago
- Zyte API integration for Scrapyβ38Updated last month
- A fork of https://github.com/AtuboDad/playwright_stealthβ129Updated 3 weeks ago