jeantardelli / data-engineering-with-pythonLinks
Here I will be exploring various tools and methods that are used in data engineering process with Python.
☆22Updated 4 years ago
Alternatives and similar repositories for data-engineering-with-python
Users that are interested in data-engineering-with-python are comparing it to the libraries listed below
Sorting:
- ☆35Updated 2 years ago
- Recohut - Learn data engineering, data science☆97Updated last year
- This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…☆100Updated 11 months ago
- ☆41Updated last year
- an end-to-end data pipeline extracting music listening habits and producing an insightful dashboard☆17Updated last year
- ☆87Updated 2 years ago
- An end-to-end project on customer segmentation☆82Updated 2 years ago
- This repo is meant to make it really easy to analyze the interplays between health and social media use.☆44Updated 3 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- ☆39Updated 2 years ago
- YouTube tutorial project☆106Updated last year
- Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.☆218Updated 2 years ago
- Final Project of the MLOps Zoomcamp hosted by DataTalksClub.☆26Updated 2 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆42Updated last year
- Mastering Big Data Analytics with PySpark, Published by Packt☆160Updated 10 months ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆80Updated last year
- All Data Engineering notebooks from Datacamp course☆115Updated 5 years ago
- PySpark Cheatsheet☆36Updated 2 years ago
- ☆15Updated 2 years ago
- Maternal Health Risk prediction MLOps pipeline☆44Updated 2 years ago
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆104Updated 4 years ago
- How to build and deploy an anonymization API with FastAPI and SpaCy☆71Updated 3 years ago
- Udacity Data Engineering Nanodegree Program☆52Updated 4 years ago
- ML Zoomcamp fall 2021 homework and stuff☆66Updated 3 years ago
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆14Updated 2 years ago
- Data Engineering with Google Cloud Platform, published by Packt☆118Updated last year
- A hands-on case study for demonstrating the stages involved in a machine learning project, from EDA to production.☆37Updated last year
- ☆142Updated 2 years ago
- Demo for CI/CD in a machine learning project☆107Updated 2 years ago
- The getting started notebook for the DTC Zoomcamp Q&A challenge☆29Updated last year