aiplanethub / data-engineeringLinks
β10Updated 3 years ago
Alternatives and similar repositories for data-engineering
Users that are interested in data-engineering are comparing it to the libraries listed below
Sorting:
- Data engineering interviews Q&A for data community by data communityβ64Updated 5 years ago
- data engineering 100 days π€ π§² π¦Ύ | #DEβ40Updated 2 years ago
- β88Updated 3 years ago
- This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics whichβ¦β102Updated last month
- This repo is meant to make it really easy to analyze the interplays between health and social media use.β46Updated 3 years ago
- Resources for the free AWS Data Engineering course on youtubeβ102Updated 4 years ago
- β90Updated 2 years ago
- Recohut - Learn data engineering, data scienceβ99Updated 2 years ago
- A course by DataTalks Club that covers Spark, Kafka, Docker, Airflow, Terraform, DBT, Big Query etcβ14Updated 3 years ago
- PySpark functions and utilities with examples. Assists ETL process of data modelingβ104Updated 4 years ago
- Classwork projects and home works done through Udacity data engineering nano degreeβ74Updated last year
- I will attempt to create my own spotify wrapped by collecting data from the spotify API, perform transformations and create informative dβ¦β73Updated 2 years ago
- Learning paths for data rolesβ143Updated 3 weeks ago
- Duke MIDS: Data Engineering and DataOps Courseβ67Updated 9 months ago
- Azure Data Engineering Cookbook 2nd-edition, published by Packtβ32Updated 2 years ago
- Code for my "Efficient Data Processing in SQL" book.β59Updated last year
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatioβ¦β55Updated 2 years ago
- Hey this is the repo that has all the queries and data for my video game training series!β154Updated 3 years ago
- This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as stagingβ¦β93Updated 6 years ago
- GitHub repository related to the course Mastering Elastic Map Reduce for Data Engineersβ24Updated 3 years ago
- This repository hosts the code/projects/demos/slides for Big Data technologies under Apache Hadoop and Apache Spark umbrella.β42Updated 3 years ago
- This repo contains all the material developed during the 9-week bootcamp provided by DPhi in colaboration with DataTalks Clubβ22Updated 3 years ago
- Udacity Data Streaming Nanodegree Programβ23Updated 4 years ago
- A Series of Notebooks on how to start with Kafka and Pythonβ152Updated 8 months ago
- Full stack data engineering tools and infrastructure set-upβ57Updated 4 years ago
- β36Updated 2 years ago
- β18Updated 7 years ago
- Mastering Big Data Analytics with PySpark, Published by Packtβ163Updated last year
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,β¦β90Updated 3 years ago
- All Data Engineering notebooks from Datacamp courseβ115Updated 5 years ago