WilliamQLiu / python-examplesLinks
Simple Python examples including data analysis, ETL, web scraping
☆76Updated 2 years ago
Alternatives and similar repositories for python-examples
Users that are interested in python-examples are comparing it to the libraries listed below
Sorting:
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago
- Simple alert system implemented in Kafka and Python☆95Updated 6 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- Airflow workflow management platform chef cookbook.☆71Updated 5 years ago
- ☆16Updated 7 years ago
- ETL with Python - Taught at DWH course 2017 (TAU)☆103Updated 7 years ago
- A guide to show you how to import data for ETL☆20Updated 2 years ago
- ☆26Updated 4 years ago
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆90Updated 3 years ago
- ☆46Updated 3 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆55Updated 2 years ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆86Updated 6 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆37Updated 5 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆77Updated 2 years ago
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- A tutorial on streaming data from a Flask REST API and streaming the response into PostgreSQL☆39Updated 5 years ago
- Demonstration of using Apache Spark to build robust ETL pipelines while taking advantage of open source, general purpose cluster computin…☆24Updated last year
- A Series of Notebooks on how to start with Kafka and Python☆154Updated 3 months ago
- Runnable e-commerce mini data warehouse based on Python, PostgreSQL & Metabase, template for new projects☆29Updated 4 years ago
- My solutions for the Udacity Data Engineering Nanodegree☆34Updated 5 years ago
- Course materials for my data pipeline video course with O'Reilly☆198Updated 7 years ago
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆51Updated 8 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…☆24Updated 3 years ago
- Udacity Data Pipeline Exercises☆15Updated 4 years ago
- Spark and Python (PySpark) Examples☆39Updated 3 years ago
- How to build an awesome data engineering team☆100Updated 5 years ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆28Updated 2 years ago
- Teaching notes from my Advanced SQL workshops as local lead instructor at General Assembly New York. The first edition was created for th…☆18Updated 5 years ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 6 years ago