omar-elmaria / python_scrapy_airflow_pipeline

This repo contains a full-fledged Python-based script that scrapes a JavaScript-rendered website, cleans the data, and pushes the results to a cloud-based database. The workflow is orchestrated on Airflow to run automatically
13Updated 2 years ago

Alternatives and similar repositories for python_scrapy_airflow_pipeline:

Users that are interested in python_scrapy_airflow_pipeline are comparing it to the libraries listed below