kadnan / airflow-scraping
Using Apache Airflow to schedule web scrapers
☆42Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for airflow-scraping
- Example of an ETL Pipeline using Airflow☆32Updated 7 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 7 years ago
- Basic tutorial of using Apache Airflow☆35Updated 6 years ago
- ☆109Updated last year
- Data lake, data warehouse on GCP☆54Updated 2 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆85Updated 3 years ago
- 🐍💨 Airflow tutorial for PyCon 2019☆85Updated last year
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- ☆46Updated 2 years ago
- Learn to build a data pipeline with Airflow to automate wrangling data - An Udacity Data Engineer Nano Degree Project☆8Updated 5 years ago
- Schedule a data pipeline in Google Cloud using cloud function, BigQuery, cloud storage, cloud scheduler, stack trace, cloud build, and p…☆26Updated 5 years ago
- Sample pytest tests for testing SQL Server assests.☆45Updated 6 years ago
- A quick and easy way to convert a Pandas DataFrame to a Tableau .hyper or .tde extract.☆61Updated 4 years ago
- Repo for building docker based airflow image. Containers support multiple features like writing logs to local or S3 folder and Initializi…☆32Updated 5 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- ETL with Python - Taught at DWH course 2017 (TAU)☆101Updated 7 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆46Updated last year
- Execution of DBT models using Apache Airflow through Docker Compose☆113Updated last year
- E-Commerce Website A/B testing: Recommend which of two landing pages to keep based on A/B testing☆24Updated 6 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆166Updated last year
- scaffold of Apache Airflow executing Docker containers☆85Updated last year
- Use Airflow to move data from multiple MySQL databases to BigQuery☆99Updated 4 years ago
- Analyzing and calculating key marketing metrics with SQL and Python☆14Updated 5 years ago
- Superset Quick Start Guide, published by Packt☆55Updated 8 months ago
- Amazon Redshift Cookbook, Published by Packt☆15Updated last year
- Airflow Tutorials☆24Updated 3 years ago
- Big Data Demystified meetup and blog examples☆31Updated 3 months ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 8 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago