dorianbg / lambda-architecture-demo
Developing a Lambda Architecture pipeline using Apache Kafka, Spark Structured Streaming, Redshift, S3, Python
☆23Updated 5 years ago
Alternatives and similar repositories for lambda-architecture-demo:
Users that are interested in lambda-architecture-demo are comparing it to the libraries listed below
- ☆148Updated 6 years ago
- A way for home buyers to know about factors affecting a state☆47Updated 6 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago
- MLFlow Spark Summit 2019 Presentation☆67Updated 5 years ago
- Repository used for Spark Trainings☆53Updated last year
- Workshop for Spark and Databricks☆54Updated 5 years ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆86Updated 5 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- Guide on creating an API for serving your ML model☆65Updated 2 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- code, labs and lectures for the course☆46Updated last year
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 6 years ago
- Deep Learning with Apache Spark and Deep Cognition☆59Updated 6 years ago
- Twitter Sentiment Analysis using Spark and Kafka☆115Updated 5 years ago
- Just a boilerplate for PySpark and Flask☆35Updated 6 years ago
- ☆37Updated 8 years ago
- Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.☆20Updated 6 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- How to build an awesome data engineering team☆100Updated 5 years ago
- My presentation at ODSC India 2018 about Deep Learning with Apache Spark☆27Updated 6 years ago
- Making Machine Learning Simple and Scalable with Python, Jupyter Notebook, TensorFlow, Keras, Apache Kafka and KSQL☆95Updated 6 years ago
- Capturing model drift and handling its response - Example webinar☆107Updated 5 years ago
- This repo contains all materials regarding Udacity's data streaming nanodegree☆8Updated 5 years ago
- PyCon SG 2016 - Customer Segmentation in Python☆56Updated 8 years ago
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆51Updated 8 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆86Updated 4 years ago
- PySpark Cookbook, published by Packt☆91Updated 2 years ago
- Udacity Data Engineering Nanodegree Projects☆11Updated 5 years ago
- ☆16Updated 7 years ago
- Simple alert system implemented in Kafka and Python☆95Updated 6 years ago