This repo contains a full-fledged Python-based script that scrapes a JavaScript-rendered website, cleans the data, and pushes the results to a cloud-based database. The workflow is orchestrated on Airflow to run automatically
☆14Oct 2, 2022Updated 3 years ago
Alternatives and similar repositories for python_scrapy_airflow_pipeline
Users that are interested in python_scrapy_airflow_pipeline are comparing it to the libraries listed below
Sorting:
- A collection of theoretical research and analysis of cryptography-based game outcomes in the pursuit of a breakthrough discovery.☆17Oct 29, 2024Updated last year
- Open source project to help the Web3 community fight frauds and scams.☆18Feb 7, 2024Updated 2 years ago
- Format code with uncrustify for Objective-C project easily.☆11Aug 7, 2025Updated 6 months ago
- A wrapper of "pluggy" to support asyncio and context managers