vergili / bigdata_tutorial
☆16Updated 7 years ago
Alternatives and similar repositories for bigdata_tutorial:
Users that are interested in bigdata_tutorial are comparing it to the libraries listed below
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- Just a boilerplate for PySpark and Flask☆35Updated 6 years ago
- Use Airflow to move data from multiple MySQL databases to BigQuery☆100Updated 4 years ago
- Simple alert system implemented in Kafka and Python☆95Updated 6 years ago
- ☆54Updated 6 years ago
- Full stack data engineering tools and infrastructure set-up☆50Updated 4 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆46Updated last year
- Business Data Analysis by HiPIC of CalStateLA☆20Updated 6 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- Udacity Data Engineering Nanodegree Projects☆11Updated 5 years ago
- Udacity Data Pipeline Exercises☆15Updated 4 years ago
- Example of an ETL Pipeline using Airflow☆34Updated 7 years ago
- Big Data Demystified meetup and blog examples☆31Updated 7 months ago
- A Getting Started Guide for developing and using Airflow Plugins☆93Updated 6 years ago
- A tutorial for using Hadoop with Python and Hive☆10Updated 9 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 5 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆86Updated 4 years ago
- Code that goes along with https://humansofdata.atlan.com/2018/06/apache-airflow-disease-outbreaks-india/☆24Updated last year
- ☆49Updated 3 years ago
- A curated list of all the awesome examples, articles, tutorials and videos for Apache Airflow.☆96Updated 4 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆37Updated 5 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆74Updated 2 years ago
- 🐍💨 Airflow tutorial for PyCon 2019☆85Updated 2 years ago
- Basic tutorial of using Apache Airflow☆36Updated 6 years ago
- ☆19Updated 4 years ago
- Example DAGs using hooks and operators from Airflow Plugins☆336Updated 6 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 8 years ago
- Code Repository for the EVO-ODAS☆31Updated 7 years ago