vergili / bigdata_tutorial
☆16Updated 6 years ago
Related projects: ⓘ
- Just a boilerplate for PySpark and Flask☆35Updated 6 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 7 years ago
- Airflow training for the crunch conf☆105Updated 5 years ago
- Code Repository for the EVO-ODAS☆31Updated 6 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- Simple alert system implemented in Kafka and Python☆93Updated 6 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- ☆48Updated 2 years ago
- Sample Airflow DAGs to load data from the CovidTracking API to Snowflake via an AWS S3 intermediary.☆16Updated 3 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- ☆22Updated this week
- Use Airflow to move data from multiple MySQL databases to BigQuery☆99Updated 4 years ago
- An example PySpark project with pytest☆17Updated 6 years ago
- Code that goes along with https://humansofdata.atlan.com/2018/06/apache-airflow-disease-outbreaks-india/☆24Updated last year
- Udacity Data Pipeline Exercises☆15Updated 4 years ago
- Basic tutorial of using Apache Airflow☆35Updated 5 years ago
- 🐍💨 Airflow tutorial for PyCon 2019☆85Updated last year
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆54Updated 5 years ago
- Big Data Demystified meetup and blog examples☆31Updated last month
- ☆54Updated 5 years ago
- event-triggered plugins for airflow☆21Updated 4 years ago
- ☆39Updated this week
- Example of an ETL Pipeline using Airflow☆31Updated 7 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆37Updated 5 years ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆22Updated last year
- Udacity Data Engineering Nanodegree Projects☆11Updated 5 years ago
- ☆16Updated 4 years ago
- Repository used for Spark Trainings☆53Updated last year
- A repository of sample code to show data quality checking best practices using Airflow.☆71Updated last year
- A four-day course on Python, the Scientific Python stack and PySpark, adapted from a training course I gave to one of our clients in Dece…☆10Updated 8 years ago