hgrif / airflow-tutorial
Airflow basics tutorial
☆397Updated 3 years ago
Alternatives and similar repositories for airflow-tutorial:
Users that are interested in airflow-tutorial are comparing it to the libraries listed below
- Example DAGs using hooks and operators from Airflow Plugins☆334Updated 6 years ago
- ETL best practices with airflow, with examples☆1,313Updated 4 months ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆174Updated last year
- Airflow Unit Tests and Integration Tests☆256Updated 2 years ago
- Airflow training for the crunch conf☆104Updated 6 years ago
- A docker image and kubernetes config files to run Airflow on Kubernetes☆654Updated 5 years ago
- Apache Airflow in Docker Compose (for both versions 1.10.* and 2.*)☆184Updated last year
- A plugin for Apache Airflow that exposes rest end points for the Command Line Interfaces☆325Updated 4 years ago
- A boilerplate for writing PySpark Jobs☆396Updated last year
- ☆197Updated last year
- Repository with examples and smoke tests for the GCP Airflow operators and hooks☆146Updated 8 years ago
- Guides and docs to help you get up and running with Apache Airflow.☆804Updated 2 years ago
- A curated list of all the awesome examples, articles, tutorials and videos for Apache Airflow.☆96Updated 4 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆166Updated last year
- A guide to running Airflow on Kubernetes☆172Updated 5 years ago
- Apache Airflow tutorial☆937Updated 2 years ago
- Use Airflow to move data from multiple MySQL databases to BigQuery☆99Updated 4 years ago
- A Getting Started Guide for developing and using Airflow Plugins☆94Updated 6 years ago
- Learn the pyspark API through pictures and simple examples☆169Updated 4 years ago
- This is a repo documenting the best practices in PySpark.☆462Updated 2 years ago
- ☆199Updated 2 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 7 years ago
- LearningApacheSpark☆245Updated last year
- A series of DAGs/Workflows to help maintain the operation of Airflow☆1,697Updated 7 months ago
- Create HTML profiling reports from Apache Spark DataFrames☆195Updated 4 years ago
- ☆517Updated 2 years ago
- Apache Spark (PySpark) Practice on Real Data☆273Updated 4 years ago
- Notes on Apache Spark (pyspark)☆296Updated 5 years ago
- ☆110Updated last month
- A curated list of data engineering tools for software developers☆472Updated 7 years ago