dorianbg / lambda-architecture-demo
Developing a Lambda Architecture pipeline using Apache Kafka, Spark Structured Streaming, Redshift, S3, Python
☆24Updated 4 years ago
Alternatives and similar repositories for lambda-architecture-demo:
Users that are interested in lambda-architecture-demo are comparing it to the libraries listed below
- ☆8Updated 5 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆83Updated 5 years ago
- Repository used for Spark Trainings☆53Updated last year
- Workshop for Spark and Databricks☆54Updated 5 years ago
- ☆148Updated 6 years ago
- PySpark-ETL☆23Updated 5 years ago
- Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four p…☆38Updated 3 years ago
- Python Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deploym…☆60Updated last year
- Code to build a simple analytics data pipeline with Python☆102Updated 7 years ago
- ☆32Updated 10 months ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- Capturing model drift and handling its response - Example webinar☆107Updated 5 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆46Updated last year
- Repo that relates to the Medium blog 'Keeping your ML model in shape with Kafka, Airflow' and MLFlow'☆118Updated last year
- A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee☆60Updated 6 years ago
- Using Kafka-Python to illustrate a ML production pipeline☆108Updated 2 years ago
- notebooks produced throughout the Udacity's Nanodegree Data Engineering Course☆73Updated 4 years ago
- Udacity Data Streaming Nanodegree Program☆22Updated 3 years ago
- Dockerize and deploy machine learning model as REST API using Flask☆77Updated last year
- Just a boilerplate for PySpark and Flask☆35Updated 6 years ago
- ☆19Updated 6 years ago
- PySpark Cookbook, published by Packt☆90Updated last year
- A way for home buyers to know about factors affecting a state☆47Updated 5 years ago
- MLFlow Spark Summit 2019 Presentation☆67Updated 5 years ago
- This repository is to host template for calculating ROI on Artificial Intelligence projects☆44Updated 5 years ago
- AWS Big Data Certification☆25Updated last week
- Slides and code examples for H2O tutorials at various events☆56Updated 7 years ago