vergili / bigdata_tutorialLinks
☆16Updated 7 years ago
Alternatives and similar repositories for bigdata_tutorial
Users that are interested in bigdata_tutorial are comparing it to the libraries listed below
Sorting:
- Blog post on ETL pipelines with Airflow☆23Updated 5 years ago
- Just a boilerplate for PySpark and Flask☆35Updated 6 years ago
- Code Repository for the EVO-ODAS☆32Updated 7 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- Business Data Analysis by HiPIC of CalStateLA☆20Updated 6 years ago
- 🐍💨 Airflow tutorial for PyCon 2019☆86Updated 2 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- scaffold of Apache Airflow executing Docker containers☆85Updated 2 years ago
- Use Airflow to move data from multiple MySQL databases to BigQuery☆100Updated 4 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- ☆49Updated 3 years ago
- Material for Talk Python Training course on Getting Started with Dask.☆28Updated 2 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- Big Data Demystified meetup and blog examples☆31Updated 10 months ago
- ☆16Updated 4 years ago
- Full stack data engineering tools and infrastructure set-up☆53Updated 4 years ago
- Udacity Data Pipeline Exercises☆15Updated 5 years ago
- ☆19Updated 4 years ago
- An example PySpark project with pytest☆16Updated 7 years ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆87Updated 6 years ago
- Airflow workflow management platform chef cookbook.☆71Updated 5 years ago
- event-triggered plugins for airflow☆21Updated 5 years ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆88Updated 4 years ago
- Sample pytest tests for testing SQL Server assests.☆46Updated 6 years ago
- Example of an ETL Pipeline using Airflow☆35Updated 7 years ago
- Building an API with the FastAPI framework to serve a scikit-learn model.☆18Updated 6 years ago
- Repository used for Spark Trainings☆53Updated 2 years ago
- Amazon Redshift Cookbook, Published by Packt☆15Updated 2 years ago