DeepHiveMind / Modern_E2E_ServerlessDataPipeline_NextGenDataLakeLinks
Next Gen Serverless E2E Data Pipeline & Workflow Orchestration | Modern Data Lake |
☆10Updated 5 years ago
Alternatives and similar repositories for Modern_E2E_ServerlessDataPipeline_NextGenDataLake
Users that are interested in Modern_E2E_ServerlessDataPipeline_NextGenDataLake are comparing it to the libraries listed below
Sorting:
- Blockchain framework/service: Hyperledger Fabric, AWS-QLDB & AWS-MSB☆13Updated 4 years ago
- Welcome to the wonderland of "AI" = f(DL, RL, DRL, ML, NLP, KG, MLOPS)☆24Updated 2 years ago
- Real-World AI/ML Ecosystem | Enterprise AI Platform Recipe | Custom MLOPS☆20Updated 2 years ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆25Updated 2 years ago
- Data warehouse implementation for an e-commerce website “Infibeam” that sells digital and consumer electronics.☆19Updated 7 years ago
- Project - Data Processing and Analysis in Python Course☆41Updated 6 years ago
- ☆87Updated 2 years ago
- Price Crawler - Tracking Price Inflation☆185Updated 4 years ago
- Docker with Airflow and Spark standalone cluster☆256Updated last year
- The web page for DataTalks.Club☆212Updated this week
- DataTalks.Club's Data Engineering Zoomcamp Project☆23Updated 2 years ago
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR☆84Updated 5 years ago
- Spark data pipeline that processes movie ratings data.☆28Updated this week
- ☆14Updated 5 years ago
- Data Engineering with Databricks Cookbook, published by Packt☆91Updated 11 months ago
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆21Updated 5 years ago
- ☆14Updated 2 years ago
- Databricks - Apache Spark™ - 2X Certified Developer☆266Updated 4 years ago
- Big Data webapp using Chicago street congestion, crashes, red light violations, and speed camera violations☆40Updated 4 years ago
- Tracking and measuring neighborhood and district-level eviction rates in the city of San Francisco.☆139Updated 4 years ago
- I'm partaking in a Data Engineering Bootcamp / Zoomcamp. I'll store files and progress here.☆106Updated 2 years ago
- This project aims to move the data from a Relational database system (RDBMS) to a Hadoop file system (HDFS)☆10Updated 3 years ago
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Updated 2 years ago
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆22Updated 3 years ago
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆103Updated 4 years ago
- Data Engineering with AWS, Published by Packt☆328Updated 2 years ago
- ☆53Updated 4 years ago
- 4 different Big Datasets joined to get single table for final data analysis. Fraud Detection by taken consideration of different key feat…☆46Updated 4 years ago
- Simple ETL pipeline using Python☆26Updated 2 years ago
- Guide for databricks spark certification☆58Updated 3 years ago