henokyemam / Wrangling_PySparkLinks
☆11Updated 4 years ago
Alternatives and similar repositories for Wrangling_PySpark
Users that are interested in Wrangling_PySpark are comparing it to the libraries listed below
Sorting:
- Some of my sql projects with sqlite.☆10Updated 4 years ago
- Challenge Data Engineer☆25Updated 3 years ago
- Laptop Prices Predictor is an end-to-end data science project that accurately predicts laptop prices using machine learning algorithms. T…☆14Updated last year
- This is a guided certification project, as a part of Data Science for Social Good initiative☆17Updated 5 years ago
- All repository files for Metis Data Science Project 5 - Content-Based Recommender for E-Commerce☆12Updated 5 years ago
- Use Multiple Linear Regression, Python, Pandas, and Matplotlib to analyze the lifetime value and the key factors of the ‘Telco Customer C…☆13Updated 5 years ago
- ☆14Updated 3 years ago
- Analysis of over 300,000 Tweets about the Brisgeerton TV Series☆11Updated 4 years ago
- All Data Engineering notebooks from Datacamp course☆115Updated 5 years ago
- ☆21Updated 2 years ago
- This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…☆103Updated 2 months ago
- Simplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices☆131Updated 4 years ago
- The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such …☆122Updated 3 years ago
- Beginner's introduction to the pandas library for data manipulation☆28Updated 4 years ago
- Data Engineering on GCP☆39Updated 3 years ago
- ☆11Updated last year
- ☆12Updated 2 years ago
- IBM Data Engineering Courses from Coursera☆71Updated 2 years ago
- Analysis of new songs website data for extracting insights and business improvement.☆17Updated 3 years ago
- Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple …☆30Updated 5 years ago
- Git Repository☆148Updated 2 months ago
- ☆14Updated 3 years ago
- Recohut - Learn data engineering, data science☆98Updated 2 years ago
- A data pipeline moving data from a Relational database system (RDBMS) to a Hadoop file system (HDFS).☆15Updated 4 years ago
- Content related to Mastering Postgresql along with videos.☆18Updated 4 years ago
- A guide to show you how to import data for ETL☆21Updated 2 years ago
- Airflow Tutorials☆25Updated 4 years ago
- SQL Tutorials using Jupyter Notebook☆17Updated 2 years ago
- In this personal Superstore Sales SQL Data Analysis project, an exploratory data analysis was performed on the Superstore Sales Data avai…☆34Updated 2 years ago
- Mastering Tableau 2021 published by Packt☆34Updated 3 weeks ago