dorianbg / lambda-architecture-demo
Developing a Lambda Architecture pipeline using Apache Kafka, Spark Structured Streaming, Redshift, S3, Python
☆24Updated 4 years ago
Related projects: ⓘ
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆54Updated 5 years ago
- Workshop for Spark and Databricks☆54Updated 4 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆81Updated 5 years ago
- code, labs and lectures for the course☆44Updated last year
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 6 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 7 years ago
- Repository used for Spark Trainings☆53Updated last year
- Deep Learning with Apache Spark and Deep Cognition☆58Updated 6 years ago
- Just a boilerplate for PySpark and Flask☆35Updated 6 years ago
- MLFlow Spark Summit 2019 Presentation☆67Updated 5 years ago
- ☆8Updated 4 years ago
- The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.☆52Updated last year
- AWS Big Data Certification☆24Updated last year
- Repo that relates to the Medium blog 'Keeping your ML model in shape with Kafka, Airflow' and MLFlow'☆120Updated last year
- My presentation at ODSC India 2018 about Deep Learning with Apache Spark☆27Updated 6 years ago
- Guide on creating an API for serving your ML model☆65Updated 2 years ago
- Airflow training for the crunch conf☆105Updated 5 years ago
- Udacity Data Engineering Nanodegree Projects☆11Updated 5 years ago
- ☆32Updated 6 months ago
- A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee☆59Updated 6 years ago
- Contains source files used in the Spark with Python course☆18Updated 5 years ago
- O'Reilly Katacoda☆55Updated last year
- ☆37Updated 7 years ago
- Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four p…☆37Updated 3 years ago
- ☆16Updated last year
- Processing tweets using Spark Streaming and identifying top trending hashtags using a real-time simple dashboard☆42Updated 2 years ago
- Udacity Data Streaming Nanodegree Program☆22Updated 3 years ago
- Demo of PySpark and Jupyter Notebook with the Jupyter Docker Stacks☆35Updated 3 years ago