vergili / bigdata_tutorialLinks
ā16Updated 7 years ago
Alternatives and similar repositories for bigdata_tutorial
Users that are interested in bigdata_tutorial are comparing it to the libraries listed below
Sorting:
- Code to build a simple analytics data pipeline with Pythonā102Updated 8 years ago
- ššØ Airflow tutorial for PyCon 2019ā85Updated 2 years ago
- Simple alert system implemented in Kafka and Pythonā96Updated 7 years ago
- Course materials for my data pipeline video course with O'Reillyā201Updated 7 years ago
- Just a boilerplate for PySpark and Flaskā35Updated 7 years ago
- Airflow training for the crunch confā105Updated 6 years ago
- scaffold of Apache Airflow executing Docker containersā86Updated 2 years ago
- Airflow basics tutorialā397Updated 4 years ago
- Blog post on ETL pipelines with Airflowā24Updated 2 weeks ago
- Code that goes along with https://humansofdata.atlan.com/2018/06/apache-airflow-disease-outbreaks-india/ā24Updated 2 years ago
- šØ Simple, self-contained fraud detection system built with Apache Kafka and Pythonā88Updated 6 years ago
- (project & tutorial) dag pipeline tests + ci/cd setupā88Updated 4 years ago
- Udacity Data Pipeline Exercisesā15Updated 5 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggleā33Updated 9 years ago
- Use Airflow to move data from multiple MySQL databases to BigQueryā100Updated 5 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.ā47Updated 2 years ago
- Code supporting Data Science articles at The Marketing Technologist, Floryn Tech Blog, and Pythom.nlā71Updated 2 years ago
- Code Repository for the EVO-ODASā32Updated 7 years ago
- A Getting Started Guide for developing and using Airflow Pluginsā93Updated 6 years ago
- Example of an ETL Pipeline using Airflowā36Updated 8 years ago
- Basic tutorial of using Apache Airflowā36Updated 6 years ago
- How to build an awesome data engineering teamā100Updated 6 years ago
- Public source code for the Batch Processing with Apache Beam (Python) online courseā18Updated 4 years ago
- Code, slides, and documentation for the talks I have given.ā113Updated 3 months ago
- Udacity Data Engineering Nanodegree Projectsā11Updated 6 years ago
- Developing a Lambda Architecture pipeline using Apache Kafka, Spark Structured Streaming, Redshift, S3, Pythonā23Updated 5 years ago
- ā87Updated 2 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill uā¦ā27Updated 6 years ago
- ā54Updated 6 years ago
- Material for Talk Python Training course on Getting Started with Dask.ā28Updated 2 years ago