aiplanethub / data-engineering
☆10Updated 2 years ago
Alternatives and similar repositories for data-engineering:
Users that are interested in data-engineering are comparing it to the libraries listed below
- ML Zoomcamp fall 2021 homework and stuff☆64Updated 3 years ago
- A Series of Notebooks on how to start with Kafka and Python☆154Updated last month
- Learning paths for data roles☆128Updated 4 years ago
- ☆87Updated 2 years ago
- Datasets for ML, Analysis, etc☆59Updated this week
- Here I will be exploring various tools and methods that are used in data engineering process with Python.☆22Updated 4 years ago
- An End-to-End ETL data pipeline that leverages pyspark parallel processing to process about 25 million rows of data coming from a SaaS ap…☆25Updated 2 years ago
- ☆34Updated last year
- Data Engineering with Google Cloud Platform, published by Packt☆116Updated last year
- Repository for Apache Spark course at Team Data Science☆16Updated 4 years ago
- Maternal Health Risk prediction MLOps pipeline☆43Updated 2 years ago
- Exercises performed as part of the ML Zoomcamp course☆30Updated 3 years ago
- Data engineering interviews Q&A for data community by data community☆63Updated 4 years ago
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆30Updated last year
- The getting started notebook for the DTC Zoomcamp Q&A challenge☆29Updated last year
- Full stack data engineering tools and infrastructure set-up☆51Updated 4 years ago
- Project for "Data pipeline design patterns" blog.☆45Updated 8 months ago
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- data engineering 100 days 🤖 🧲 🦾 | #DE☆40Updated last year
- Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift,…☆56Updated 2 years ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆74Updated 10 months ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆27Updated 2 years ago
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆45Updated 5 years ago
- Recohut - Learn data engineering, data science☆96Updated last year
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 4 years ago
- Repository for Data Engineering Zoomcamp 2024☆14Updated last year
- ☆40Updated 9 months ago
- I will attempt to create my own spotify wrapped by collecting data from the spotify API, perform transformations and create informative d…☆74Updated 2 years ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 8 months ago
- Found a data engineering challenge or participated in a selection process ? Share with us!☆65Updated 2 years ago