arpit-mittal-ds / Data-ArchitectLinks

plan, design and implement enterprise data infrastructure solutions and create the blueprints for an organization’s data management system. You’ll create a relational database with PostGreSQL, design an Online Analytical Processing (OLAP) data model to build a cloud based data warehouse, and design scalable data lake architecture that meets the …

☆14

Alternatives and similar repositories for Data-Architect

Users that are interested in Data-Architect are comparing it to the libraries listed below

Sorting:

sankamuk / PysparkCheatsheet
PySpark Cheatsheet
☆36Updated 2 years ago
itversity / data-engineering-spark
☆88Updated 3 years ago
vim89 / datapipelines-essentials-python
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…
☆55Updated 2 years ago
BenSchr / Udacity-Data-Engineering-Projects
My solutions for the Udacity Data Engineering Nanodegree
☆34Updated 6 years ago
kislerdm / data-engineering-interviews
Data engineering interviews Q&A for data community by data community
☆64Updated 5 years ago
LearningJournal / Spark-Streaming-In-Python
Apache Spark 3 - Structured Streaming Course Material
☆125Updated 2 years ago
martandsingh / ApacheSpark
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…
☆103Updated last month
damklis / etljob
Simple ETL pipeline using Python
☆28Updated 2 years ago
hyunjoonbok / PySpark
PySpark functions and utilities with examples. Assists ETL process of data modeling
☆104Updated 4 years ago
immu0001 / Udacity-Data-Engineer-nanodegree
Classwork projects and home works done through Udacity data engineering nano degree
☆74Updated last year
RajenDharmendra / SparkQA
Apache Spark Interview Question and Answers
☆21Updated 5 years ago
rohitrsp898 / Basic_ETL_PySpark
☆21Updated 2 years ago
jleetutorial / python-spark-streaming
☆151Updated 7 years ago
shawlu95 / Data-Engineering-Toolbox
Lecture notes, lab notes, and links to helpful resources to pass Google Certification Exam for Professional Data Engineer.
☆18Updated 3 years ago
vivek-bombatkar / MyLearningNotes
Because its never late to start taking notes and 'public' it...
☆61Updated 5 months ago
manuel-lang / Data-Engineering-Nanodegree
Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift,…
☆57Updated 3 years ago
Saurav3218 / Pyspark_Questions_SKS
This repo is mostly created for pyspark and hive related interview questions.
☆48Updated 3 years ago
itversity / mastering-emr
GitHub repository related to the course Mastering Elastic Map Reduce for Data Engineers
☆24Updated 3 years ago
danieldiamond / data-engineering-capstone
Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development
☆21Updated 6 years ago
supratim94336 / DataEngineeringCapstoneProject
😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS
☆50Updated 6 years ago
raveendratal / ravi_azureadbadf
Ravi Azure ADB ADF Repository
☆64Updated 9 months ago
nareshk1290 / Udacity-Data-Engineering
Udacity Data Engineering Nano Degree (DEND)
☆187Updated 5 years ago
ajupton / big-data-engineering-project
Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR
☆88Updated 6 years ago
vivek-bombatkar / Spark-with-Python---My-learning-notes-
ETL pipeline using pyspark (Spark - Python)
☆116Updated 5 years ago
VyuWing-Learning / Data-Engineering-Bootcamp-Apache-Spark
☆13Updated 4 years ago
shravan-kuchkula / udacity-data-eng-proj2
A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…
☆24Updated 3 years ago
Realsid / databricks-spark-certification
Guide for databricks spark certification
☆58Updated 4 years ago
PacktPublishing / Mastering-Big-Data-Analytics-with-PySpark
Mastering Big Data Analytics with PySpark, Published by Packt
☆163Updated last year
shravan-kuchkula / udacity-data-eng-proj-1
Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…
☆89Updated 3 years ago
dgadiraju / itversity-books
☆117Updated 5 years ago