dorianbg / lambda-architecture-demoLinks
Developing a Lambda Architecture pipeline using Apache Kafka, Spark Structured Streaming, Redshift, S3, Python
☆23Updated 5 years ago
Alternatives and similar repositories for lambda-architecture-demo
Users that are interested in lambda-architecture-demo are comparing it to the libraries listed below
Sorting:
- Repository used for Spark Trainings☆53Updated 2 years ago
- Workshop for Spark and Databricks☆54Updated 5 years ago
- code, labs and lectures for the course☆47Updated 2 years ago
- ☆150Updated 7 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 6 years ago
- ☆33Updated last year
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- MLFlow Spark Summit 2019 Presentation☆67Updated 6 years ago
- Insight Data Engineering Project☆15Updated 4 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- Udacity Data Engineering Nanodegree Projects☆11Updated 5 years ago
- Data Science Quick Tips Repository!☆47Updated last year
- Repo that relates to the Medium blog 'Keeping your ML model in shape with Kafka, Airflow' and MLFlow'☆120Updated 2 years ago
- ☆37Updated 3 weeks ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- [Video]AWS Certified Machine Learning-Specialty (ML-S) Guide☆121Updated 5 months ago
- ☆40Updated 7 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 5 years ago
- ☆9Updated 5 years ago
- The goal of this repository is to detect the outliers for a dataset & see the impact of these outliers on predictive models☆23Updated 7 years ago
- A way for home buyers to know about factors affecting a state☆48Updated 6 years ago
- My presentation at ODSC India 2018 about Deep Learning with Apache Spark☆27Updated 6 years ago
- Code supporting Data Science articles at The Marketing Technologist, Floryn Tech Blog, and Pythom.nl☆70Updated 2 years ago
- This is part of the Artificial Intelligence live course, hosted by Packtpub. In this repository, you can find information to build your e…☆15Updated 6 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆36Updated 6 years ago
- ☆25Updated 7 years ago
- Here's how to get DataQuest's Data Engineering Track missions' content to work on your localhost. Using data from my Valenbisi ARIMA mode…☆15Updated 6 years ago
- Contains source files used in the Spark with Python course☆18Updated 6 years ago